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5 (57) Abstract: The present invention is directed to novel nucleic acids that encode novel HDAC proteins, which have an affect on 
chromating structure and transcription. Also provided herein are vectors and host cells comprising those nucleic acid sequences, 
chimeric polypeptide molecules comprising the polypeptides of the present invention fused to heterologous polypeptide sequences, 

Q antibodies which bind to the polypeptides of the present invention and to methods of producing the polypeptides of the present 

£^ invention. Further provided by the present invention are methods of identifying novel compositions which mediate HDAC8 and the 

^ use of such compositions in diagnosis and treatment of disease. 
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5 HISTONE DEACETYLASE-8 PROTEINS, NUCLEIC ACIDS, AND METHODS OF USE 

FIELD OF THE INVENTION 

The present invention is directed to compositions involved in control of chromatin structure and 
thus, regulation of transcription and methods of use. More particularly, the present invention is 
directed to nucleic acids encoding histone deacetylase proteins involved in transcription J 
1 0 regulation. Methods of use include use in assays screening for modulators of transcription and 
use as therapeutics. 

BACKGROUND OF THE INVENTION 

Modification of chromatin structure via the reversible acetylation of nucleosomal histones plays 
an important role in the regulation of transcription in eukaryotic cells (reviewed in: Grunstein. 

1 5 1 997. Nature 389:349-352). Acetylation of the e-amino group of specific lysine residues within 
the amino terminal tail of core histones results in localized chromatin relaxation. This acetylation 
is required to maintain the unfolded structure of nucleosomes undergoing transcription (Walia et 
al. 1998. J. Biol. Chem. 273:14516-14522). In general, histone acetylation activity correlates 
with transcriptional activation, whereas deacetylase activity correlates with transcriptional 

20 repression. Global changes in gene expression are thought to result from the dynamic 

equilibrium between histone acetyl transferase (HAT) enzyme activity and histone deacetylase 
(HDAC) enzyme activity within the nucleus of a cell. 

HDACs mediate transcriptional repression by interacting with larger multisubunit complexes. 
For example, a specific HDAC known as HDAC1 is known to bind the corepressor Sin3, and 

25 HDAC1-Sin3 further associates with the silencing mediators NCoR and SMRT (Alland, et al. 
1997. Nature 387:49-55). N-CoR/SMRT-HDAC1 is then recruited by specific transcription 
factors bound to promoter elements within the nucleus. For example, the retinoblastoma (Rb) 
gene product recruits N-CoR/SMRT-HDAC1 to bind the transcription factor E2F to repress E2F 
regulated promoters (Luo et al. 1998. Cell 92:463-473; Brehm et al. 1998. Nature. 391:597-600, 

30 Magnaghi-Jaulin et al. 1998. Nature 391:601-605). HDAC1-Sin3 also binds to and mediates 
repression by the MAD/MAX repressor heterodimer (Laherty et al. 1997. Cell 89:349-356). The 
histone deacetylation activity of HDAC1 is essential for this transcriptional repression (Hassig et 
al. 1998. PNAS 95:3519-3524). 
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Several cDNAs encoding histone deacetylases have been characterized. The first to be cloned 
was the yeast protein RPD3 t which was first identified in genetic screens for transcriptional 
repressors (Vidal etal. 1991. Mol. Cell Biol. 11:6317-6327). Mammalian HDAC1 was cloned 
independently as the molecular target of TSA (Taunton et al. 1996. Science 272:408-411). 
5 HDAC1 was observed to be an ortholog of yeast RPD3, and both were shown to have HDAC 
activity in vitro. Three cDNAs, HDAC1, HDAC2, and HDAC3, encoding proteins homologous to 
RPD3 have been described (Yang et al. 1997. J. Biol. Chem. 272:28001-28007; Emiliani et al. 
1998. PNAS 95:2795-2800). All three have ubiquitous tissue distributions, and their activities 
appear to be overlapping. To date, five S. cerevisiae genes have been found which encode 

1 0 HDACs, and the family has been divided into two classes based on sequence conservation and 
proposed biochemical function. S. cerevisiae Hos1 , Hos2 ? Hos3, and RPD3, together with 
mammalian HDAC1, 2, and 3 comprise class 1 . The second class is comprised of the yeast 
HDA1 and HDA3 and the recently identified human homologs HDAC4, HDAC5, and HDAC6 
(Grozinger et al. 1999. PNAS 96:4868-4873; Fischle et al. 1999. J. Biol. Chem. 274:1 1713- 

15 11720). 

As HDACs are known to function as part of multisubunit transcriptional regulatory complexes, 
regulating HDACs may be used to modulate, i.e., increase or decrease, the transcription of a 
specific gene or set of specific genes under the control of a regulatory complex containing an 
HDAC (Carmen et al. 1996. J. Biol. Chem. 271 (26):15837-1 5844. 

20 A variety of compounds possessing HDAC inhibitory activity have been shown to lead to 
hyperacetylation of histones, and this modification is accompanied by cell cycle arrest and 
terminal differentiation in a number of cell types (See Proceedings of the American Association 
for Cancer Research 40, March, 1999). Like NaBu, the fungal toxin trichostatin A (TSA), a 
potent, specific, and reversible inhibitor, has been shown to induce cell cycle arrest and/or 

25 differentiation in a variety of systems. Recently, a synthetic benzamide derivative (MS-27275) 
with HDAC inhibitory activity was described (Saito et al. 1999. PNAS 96:4592-5497). MS-27275 
induces histone hyperacetylation, and demonstrates efficacy in inhibiting tumor growth in tumor 
bearing nude mice. 

In acute promyelocyte leukemia (APL), a genetic rearrangement results in the fusion of the 
30 retinoic acid receptor a (RARa) with either the PML or PLZF proteins. Both PML-RARa and 
PLZF- RARa aberrantly recruit N-CoR/S M RT-H DAC and mediate leukaemogenesis by 
repressing retinoic acid regulated genes (Lin et al. 1998. Nature 391:811-814; Grignani et al. 
1998. Nature 391:815-818). Indeed, the therapeutic use of sodium butyrate (NaBu), a 
nonspecific HDAC inhibitor, to target transcription in APL has been successful even in retinoic 
35 acid resistant forms of the disease (Warrell et al. 1998. J. Natl. Cancer Inst. 90:1621-1625). A 
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role for HDAC has also been demonstrated in acute myeloid leukemias that arise as a result of 
the fusion of the AML1 transcription factor with the ETO ("eight twenty one" or MT68) gene 
product (Gelmetti et al. 1998. Mol. Cell Biol. 18:7185-7191; Wang et al. 1998. PNAS 95:10860- 
10865). Together, this suggests that inhibition of HDAC is a viable approach to the treatment of 
5 certain leukemias. There is further evidence for a role of HDAC in other types of cancers, 
including colon cancer (Hassig et al. 1997. Chem. Biol. 4:783-789; Archer et al. 1998. PNAS 
95:6791-6796), squamous cell carcinoma (Gillenwater et al., 1998. Int. J. Cancer 75:217-224; 
Saunders et al. 1999. Cancer Res. 59:399-404), adenocarcinomas (McBain et al. 1997. 
Biochem. Pharmacol. 53:1357-1368), and neuroblastomas (Swendeman et al. 1999. Proc. 
1 0 Amer. Assoc. Cancer Res. 40, Abst. #3836). HDACs have also been implicated in 

transcriptional regulation mediated by BRCA1 (Yarden et al. 1999. Proc. Ameri. Assoc. Cancer 
Res. 40, Abstr. #3387) and c-fos (Bakin and Curran. 1999. Science 283:387-390). 

On the other hand, it is also sometimes desirable to enhance proliferation of cells in a controlled 
manner. For example, proliferation of cells is useful in wound healing and where growth of 
1 5 tissue is desirable. Thus, identifying modulators which promote, enhance or deter the inhibition 
of proliferation is desirable. 

Despite the desirability of identifying transcription components and modulators, there is a deficit 
in the field of such compounds. Accordingly, it would be advantageous to provide compositions 
and methods useful in screening for modulators of transcription. It would also be advantageous 
20 to provide novel compositions which are involved transcription regulation. 

SUMMARY OF THE INVENTION 
In accordance with the objects outlined above, the present invention provides a nucleic acid that 
encodes an HDAC protein, wherein the polypeptide is designated in the present application as 
"HDAC8". 

25 In one embodiment, the invention provides an isolated nucleic acid molecule encoding HDAC8. 
The present invention also provides for HDAC8 proteins and fragments thereof. 

The invention provides nucleic acid encoding an HDAC protein that hybridizes under high 
stringency conditions to a nucleic acid encoding HDAC8 protein. Accordingly, the invention 
provides for methods of identifying a nucleic acid that encodes and HDAC8 protein or fragments 
30 thereof. 

The present invention further provides expression vectors that encode and express an HDAC8 
protein. 
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In another aspect, the invention provides host cells comprising recombinant nucleic acids and 
expression vectors encoding HDAC8 protein. Accordingly, the invention provides methods of 
producing an HDAC8 protein. 

In a further aspect, the invention provides a recombinant polypeptide comprising an HDAC8 
5 amino acid sequence and methods of identifying a polypeptide comprising an HDAC8 amino 
acid sequence. 

In yet another aspect, the invention provides antibodies that bind to an HDAC8 polypeptide. 

In yet another aspect, the invention provides methods of screening for a bioactive agent that 
binds to an HDAC8 protein. 

10 In still yet another aspect, the invention provides methods of identifying disease states 
associated with aberrant HDAC8 expression. 

Other aspects of the invention will become apparent to the skilled artisan by the following 
description of the invention. 

BRIEF DESCRIPTION OF THE DRAWINGS 

1 5 Figure 1 depicts the complete nucleotide sequence of the cDNA clone, termed HDAC8 whose 
sequence (SEQ ID NO:1) predicts a protein (SEQ ID NO:2) product with homology to the RPD3 
class (I) of HDACs. The translation initiator and terminator codons are underlined. 

Figure 2 depicts the amino acid sequence similarity between HDAC1 (SEQ ID NO:3), HDAC2 
(SEQ ID NO:4), HDAC3 (SEQ ID NO:5), and HDAC8 (SEQ ID NO:6). Identical residues are 
20 boxed in dark gray and conservative substitutions are in light gray. The predicted 377 amino 
acid HDAC8 (approximately 42 kDa) product appears to be missing a region homologous to the 
carboxyl terminal -100 amino acids of HDAC1, 2, and 3. 



DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides histone deacetylase-8 (HDAC8) proteins and nucleic acids 
25 which encode such proteins. Also provided are methods of screening for a bioactive agent 
capable of binding and preferably modulating the activity of the HDAC8 protein. The method 
comprises combining a HDAC8 protein and a candidate bioactive agent and a cell or a 
population of cells, and determining the effect on the cell in the presence and absence of the 
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candidate agent. Other screening assays including binding assays are also provided herein as 
described below. Therapeutics for regulating or modulating the HDAC8 protein are also 
provided and described herein. Diagnostics, as further described below, are also provided 
herein. 

5 A HDAC8 protein of the present invention may be identified in several ways. "Protein" in this 
sense includes proteins, polypeptides, and peptides. The HDAC8 proteins of the invention fall 
into two general classes: proteins that are completely novel, i.e. are not part of a public 
database as of the time of discovery, although they may have homology to either known 
proteins or peptides encoded by expressed sequence tags (ESTs). Alternatively, the HDAC8 

1 0 proteins are known proteins, but that were not known to be involved in chromatin structure 
regulation, histone acetylation, transcription regulation etc; i.e. they are identified herein as 
having a novel biological function. Accordingly, an HDAC8 protein may be initially identified by 
its association with a protein known to be involved in histone acetylation and/or transcription. 
Wherein the HDAC8 proteins and nucleic acids are novel, compositions and methods of use are 

1 5 provided herein. In the case that the HDAC8 proteins and nucleic acids were known but not 
known to be involved in chromatin structure regulation, histone acetylation and/or transcription 
as described herein, methods of use, i.e. functional screens, are provided. 

In one embodiment provided herein, an HDAC8 protein as defined herein has one or more of 
the following characteristics: acetylation of histones; homology to HDAC1 (SEQ ID NO:3), 
20 HDAC2 (SEQ ID NO:4), HDAC3 (SEQ ID NO:5), and HDAC8 (SEQ ID NO:6). The homology is 
found using the following database, algorithm, and parameters. By "homology" herein is meant 
sequence similarity and identity with identity being preferred. 

In one embodiment, HDAC8 nucleic acids or HDAC8 proteins are initially identified by 
substantial nucleic acid and/or amino acid sequence identity or similarity to the sequence(s) 
25 provided herein. In a preferred embodiment, HDAC8 nucleic acids or HDAC8 proteins have 
sequence identity or similarity to the sequences provided herein as described below and one or 
more of the HDAC8 protein bioactivities as further described below. Such sequence identity or 
similarity can be based upon the overall nucleic acid or amino acid sequence. 

In a preferred embodiment, a protein is a "HDAC8 protein" as defined herein if the overall 
30 sequence identity of the amino acid sequence of Figure 1 is preferably greater than about 60%, 
more preferably greater than about 70%, even more preferably greater than about 80% and 
most preferably greater than 90%. In some embodiments the sequence identity will be as high 
as about 93 to 95 or 98%. 

As is known in the art, a number of different programs can be used to identify whether a protein 
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(or nucleic acid as discussed below) has sequence identity or similarity to a known sequence. 
Sequence identity and/or similarity is determined using standard techniques known in the art, . 
including, but not limited to, the local sequence identity algorithm of Smith & Waterman, Adv. 
Appl. Math. 2:482 (1981), by the sequence identity alignment algorithm of Needleman & 
5 Wunsch, J. Mol. Biol. 48:443 (1 970), by the search for similarity method of Pearson & Lipman, 
PNAS USA 85:2444 (1988), by computerized implementations of these algorithms (GAP, 
BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics 
Computer Group, 575 Science Drive, Madison, Wl), the Best Fit sequence program described 
by Devereux et a/., Nucl. Acid Res. 72:387-395 (1984), preferably using the default settings, or 
1 0 by inspection. Preferably, percent identity is calculated by FastDB based upon the following 
parameters: mismatch penalty of 1 ; gap penalty of 1; gap size penalty of 0.33; and joining 
penalty of 30, "Current Methods in Sequence Comparison and Analysis," Macromolecule 
Sequencing and Synthesis, Selected Methods and Applications, pp 127-149 (1988), Alan R. 
Liss, Inc. 

1 5 An example of a useful algorithm is PILEUP. PILEUP creates a multiple sequence alignment 
from a group of related sequences using progressive, pairwise alignments. It can also plot a 
tree showing the clustering relationships used to create the alignment. PILEUP uses a 
simplification of the progressive alignment method of Feng & Doolittle, J. Mol. Evol. 35:351-360 
(1987); the method is similar to that described by Higgins & Sharp CABIOS 5:151-153 (1989). 

20 Useful PILEUP parameters including a default gap weight of 3.00, a default gap length weight of 
0.1 0, and weighted end gaps. 

Another example of a useful algorithm is the BLAST algorithm, described in Altschul et al., J. 
Mol. Biol. 215, 403-410, (1990) and Karlin et al., PNAS USA 90:5873-5787 (1993). A 
particularly useful BLAST program is the WU-BLAST-2 program which was obtained from 

25 Altschul et al., Methods in Enzvmoloav . 266 : 460-480 (1 996); http://blast.wustl/edu/blast/ 
README.html]. WU-BLAST-2 uses several search parameters, most of which are set to the 
default values. The adjustable parameters are set with the following values: overlap span =1 , 
overlap fraction = 0.125, word threshold (T) = 1 1 . The HSP S and HSP S2 parameters are 
dynamic values and are established by the program itself depending upon the composition of 

30 the particular sequence and composition of the particular database against which the sequence 
of interest is being searched; however, the values may be adjusted to increase sensitivity. 

An additional useful algorithm is gapped BLAST as reported by Altschul et al. Nucleic Acids 
Res. 25:3389-3402. Gapped BLAST uses BLOSUM-62 substitution scores; threshold T 
parameter set to 9; the two-hit method to trigger ungapped extensions; charges gap lengths of k 
35 a cost of 1 0+/r, X u set to 1 6, and X g set to 40 for database search stage and to 67 for the output 
stage of the algorithms. Gapped alignments are triggered by a score corresponding to -22 bits. 
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A % amino acid sequence identity value is determined by the number of matching identical 
residues divided by the total number of residues of the "longer" sequence in the aligned region. 
The "longer 11 sequence is the one having the most actual residues in the aligned region (gaps 
introduced by WU-Blast-2 to maximize the alignment score are ignored). 

5 In a similar manner, "percent (%) nucleic acid sequence identity" with respect to the coding 
sequence of the polypeptides identified herein is defined as the percentage of nucleotide 
residues in a candidate sequence that are identical with the nucleotide residues in the coding 
sequence of the HDAC8 protein. A preferred method utilizes the BLASTN module of WU- 
BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 
10 0.125, respectively. 

The alignment may include the introduction of gaps in the sequences to be aligned. In addition, 
for sequences which contain either more or fewer amino acids than the protein encoded by the 
sequences in the figures, it is understood that in one embodiment, the percentage of sequence 
identity will be determined based on the number of identical amino acids in relation to the total 
1 5 number of amino acids. Thus, for example, sequence identity of sequences shorter than that 
shown in Figure 1 , as discussed below, will be determined using the number of amino acids in 
the shorter sequence, in one embodiment. In percent identity calculations relative weight is not 
assigned to various manifestations of sequence variation, such as, insertions, deletions, 
substitutions, etc. 

20 In one embodiment, only identities are scored positively (+1) and all forms of sequence variation 
including gaps are assigned a value of "0", which obviates the need for a weighted scale or 
parameters as described herein for sequence similarity calculations. Percent sequence identity 
can be calculated, for example, by dividing the number of matching identical residues by the 
total number of residues of the "shorter" sequence in the aligned region and multiplying by 100. 

25 The "longer" sequence is the one having the most actual residues in the aligned region. 

As will be appreciated by those skilled in the art, the sequences of the present invention may 
contain sequencing errors. That is, there may be incorrect nucleosides, frameshifts, unknown 
nucleosides, or other types of sequencing errors in any of the sequences; however, the correct 
sequences will fall within the homology and stringency definitions herein. 

30 HDAC8 proteins of the present invention may be shorter or longer than the amino acid 

sequence encoded by the nucleic acid shown in Figure 1. Thus, in a preferred embodiment, 
included within the definition of HDAC8 proteins are portions or fragments of the amino acid 
sequence encoded by the nucleic acid sequence provided herein. In one embodiment herein, 
fragments of HDAC8 proteins are considered HDAC8 proteins if a) they share at least one 
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antigenic epitope; b) have at least the indicated sequence identity; and c) preferably have 
histone deacetylase-8 biological activity as further defined herein. In some cases, where the 
sequence is used diagnostically, that is, when the presence or absence of a HDAC8 protein 
nucleic acid is determined, only the indicated sequence identity is required. The nucleic acids 
5 of the present invention may also be shorter or longer than the sequence in Figure 1 . The 
nucleic acid fragments include any portion of the nucleic acids provided herein which have a 
sequence not exactly previously identified; fragments having sequences with the indicated 
sequence identity to that portion not previously identified are provided in an embodiment herein. 

In addition, as is more fully outlined below, HDAC8 proteins can be made that are longer than 
1 0 those depicted in Figure 1 ; for example, by the addition of epitope or purification tags, the 
addition of other fusion sequences, or the elucidation of additional coding and non-coding 
sequences. As described below, the fusion of a HDAC8 peptide to a fluorescent peptide, such 
as Green Fluorescent Peptide (GFP), is particularly preferred. 

HDAC8 proteins may also be identified as encoded by HDAC8 nucleic acids which hybridize to 
15 the sequence depicted in the Figure 1 , or the complement thereof, as outlined herein. 
Hybridization conditions are further described below. 

In a preferred embodiment, when a HDAC8 protein is to be used to generate antibodies, a 
HDAC8 must share at least one epitope or determinant with the full length protein. By "epitope" 
or "determinant" herein is meant a portion of a protein which will generate and/or bind an 

20 antibody. Thus, in most instances, antibodies made to a smaller HDAC8 protein will be able to 
bind to the full length protein. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. The term "antibody" 
includes antibody fragments, as are known in the art, including Fab, Fab 2 , single chain 
antibodies (Fv for example), chimeric antibodies, etc., either produced by the modification of 

25 whole antibodies or those synthesized de novo using recombinant DNA technologies. 

In a preferred embodiment, the antibodies to a HDAC8 protein upon binding to an HDAC8 
protein reduce or eliminate at least one biological function of the HDAC8 protein as described 
herein. That is, the addition of anti-HDAC8 protein antibodies (either polyclonal or preferably 
monoclonal) to HDAC8 proteins (or cells containing HDAC8 proteins) may reduce or eliminate 
30 an HDAC8 activity. Generally, at least a 25% decrease in activity is preferred, with at least 
about 50% being particularly preferred and about a 95-100% decrease being especially 
preferred. 

The HDAC8 antibodies of the invention specifically bind to HDAC8 proteins. In a preferred 
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embodiment, the antibodies specifically bind to HDAC8 proteins. By "specifically bind" herein is 
meant that the antibodies bind to the protein with a binding constant in the range of at least 10* 4 - 
10* 6 M*\ with a preferred range being 10" 7 - 10' 9 M"\ Antibodies are further described below. 

in the case of the nucleic acid, the overall sequence identity of the nucleic acid sequence is 
5 commensurate with amino acid sequence identity but takes into account the degeneracy in the 
genetic code and codon bias of different organisms. Accordingly, the nucleic acid sequence 
identity may be either lower or higher than that of the protein sequence. Thus the sequence 
identity of the nucleic acid sequence as compared to the nucleic acid sequence of the Figure is 
preferably greater than 75%, more preferably greater than about 80%, particularly greater than 
1 0 about 85% and most preferably greater than 90%. In some embodiments the sequence identity 
will be as high as about 93 to 95 or 98%. 

In a preferred embodiment, a HDAC8 nucleic acid encodes a HDAC8 protein. As will be 
appreciated by those in the art, due to the degeneracy of the genetic code, an extremely large 
number of nucleic acids may be made, all of which encode the HDAC8 proteins of the present 
1 5 invention. Thus, having identified a particular amino acid sequence, those skilled in the art 
could make any number of different nucleic acids, by simply modifying the sequence of one or 
more codons in a way which does not change the amino acid sequence of the HDAC8 protein. 

In one embodiment, the nucleic acid is determined through hybridization studies. Thus, for 
example, nucleic acids which hybridize under high stringency to the nucleic acid sequence 

20 shown in Figure 1 , or its complement is considered an HDAC8 nucleic acid. High stringency 
conditions are known in the art; see for example Maniatis et a!., Molecular Cloning: A 
Laboratory Manual, 2d Edition, 1989, and Short Protocols in Molecular Biology, ed. Ausubel, et 
al., both of which are hereby incorporated by reference. Stringent conditions are sequence- 
dependent and will be different in different circumstances. Longer sequences specifically 

25 hybridize at higher temperatures. An extensive guide to the hybridization of nucleic acids is 
found in Tijssen, Techniques in Biochemistry and Molecular Biology-Hybridization with Nucleic 
Acid Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-1 0°C lower than the thermal 
melting point (TJ for the specific sequence at a defined ionic strength pH. The T m is the 

30 temperature (under defined ionic strength, pH, and nucleic acid concentration) at which 50% of 
the probes complementary to the target hybridize to the target sequence at equilibrium (as the 
target sequences are present in excess, at T m , 50% of the probes are occupied at equilibrium). 
Stringent conditions will be those in which the salt concentration is less than about 1 .0 M 
sodium ion, typically about 0.01 to 1 .0 M sodium ion concentration (or other salts) at about pH 

35 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g. about 1 0 to 50 
nucleotides) and at least about 60°C for long probes (e.g. greater than about 50 nucleotides). 
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Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. 

In another embodiment, less stringent hybridization conditions are used; for example, moderate 
or low stringency conditions may be used, as are known in the art; see Maniatis and Ausubel, 
5 supra, and Tijssen, supra. 

The HDAC8 proteins and nucleic acids of the present invention are preferably recombinant. As 
used herein and further defined below, "nucleic acid" may refer to either DNA or RNA, or 
molecules which contain both deoxy- and ribonucleotides. The nucleic acids include genomic 
DNA, cDNA and oligonucleotides including sense and anti-sense nucleic acids. Such nucleic 
1 0 acids may also contain modifications in the ribose-phosphate backbone to increase stability and 
half life of such molecules in physiological environments. 

The nucleic acid may be double stranded, single stranded, or contain portions of both double 
stranded or single stranded sequence. As will be appreciated by those in the art, the depiction 
of a single strand ("Watson") also defines the sequence of the other strand ("Crick"); thus the 

1 5 sequences depicted in the Figures also include the complement of the sequence. By the term 
"recombinant nucleic acid" herein is meant nucleic acid, originally formed in vitro, in general, by 
the manipulation of nucleic acid by endonucleases and/or polymerases and/or ligases, in a form 
not normally found in nature. Thus an isolated HDAC8 nucleic acid, in a linear form, or an 
expression vector formed in vitro by ligating DNA molecules that are not normally joined, are 

20 both considered recombinant for the purposes of this invention. It is understood that once a 
recombinant nucleic acid is made and reintroduced into a host cell or organism, it will replicate 
non-recombinantly, i.e. using the in vivo cellular machinery of the host cell rather than in vitro 
manipulations; however, such nucleic acids, once produced recombinantly, although 
subsequently replicated non-recombinantJy, are still considered recombinant for the purposes of 

25 the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant techniques, i.e. through 
the expression of a recombinant nucleic acid as depicted above. A recombinant protein is 
distinguished from naturally occurring protein by at least one or more characteristics. For 
example, the protein may be isolated or purified away from some or all of the proteins and 

30 compounds with which it is normally associated in its wild type host, and thus may be 

substantially pure. For example, an isolated protein is unaccompanied by at least some of the 
material with which it is normally associated in its natural state, preferably constituting at least 
about 0.5%, more preferably at least about 5% by weight of the total protein in a given sample. 
A substantially pure protein comprises at least about 75% by weight of the total protein, with at 

35 least about 80% being preferred, and at least about 90% being particularly preferred. The 
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definition includes the production of an HDAC8 protein from one organism in a different 
organism or host cell. Alternatively, the protein may be made at a significantly higher 
concentration than is normally seen, through the use of an inducible promoter or high 
expression promoter, such that the protein is made at increased concentration levels. 
5 Alternatively, the protein may be in a form not normally found in nature, as in the addition of an 
epitope tag or amino acid substitutions, insertions and/or deletions, as discussed below. 

In one embodiment, the present invention provides HDAC8 protein variants. These variants fall 
into one or more of three classes: substitutional, insertional or deletional variants. These 
variants ordinarily are prepared by site specific mutagenesis of nucleotides in the DNA 

1 0 encoding a HDAC8 protein, using cassette or PCR mutagenesis, gene shuffling or other 
techniques well known in the art, to produce DNA encoding the variant, and thereafter 
expressing the DNA in recombinant cell culture as outlined above. However, variant HDAC8 
protein fragments having up to about 100-150 residues may be prepared by in vitro synthesis 
using established techniques. Amino acid sequence variants are characterized by the 

1 5 predetermined nature of the variation, a feature that sets them apart from naturally occurring 
allelic or interspecies variation of the HDAC8 protein amino acid sequence. The variants 
typically exhibit the same qualitative biological activity as the naturally occurring analogue, 
although variants can also be selected which have modified characteristics as will be more fully 
outlined below. 

20 While the site or region for introducing an amino acid sequence variation is predetermined, the 
mutation per se need not be predetermined. For example, in order to optimize the performance 
of a mutation at a given site, random mutagenesis may be conducted at the target codon or 
region and the expressed HDAC8 variants screened for the optimal combination of desired 
activity. Techniques for making substitution mutations at predetermined sites in DNA having a 

25 known sequence are well known and include , for example, M13 primer mutagenesis and PCR 
mutagenesis. Screening of the mutants is done using assays of HDAC8 protein activities 
and/or properties. 

Amino acid substitutions are typically of single residues; insertions usually will be on the order of 
from about 1 to 20 amino acids, although considerably larger insertions may be tolerated. 
30 Deletions range from about 1 to about 20 residues, although in some cases deletions may be 
much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to arrive at a final 
derivative. Generally these changes are done on a few amino acids to minimize the alteration 
of the molecule. However, larger changes may be tolerated in certain circumstances. When 
35 small alterations in the characteristics of the HDAC8 protein are desired, substitutions are 
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generally made in accordance with the following chart: 

Chart I 

Original Residue Exemplary Substitutions 



Ala 

Ala 


ber 


Arg 


Lys 


Asn 


Gin, His 


Asp 


GIU 


Cys 


Ser 


Gin 


Asn 


Glu 


Asp 


Gly 


Pro 


MIS 


Ann r>U 

Asn, Gin 


lie 


Leu, Val 


Leu 


lie, Val 


Lys 


Arg, Gin, Glu 


Met 


Leu, lie 


Phe 


Met, Leu, Tyr 


Ser 


Thr 


Thr 


Ser 


Trp 


Tyr 


Tyr 


Trp, Phe 


Val 


He, Leu 



Substantial changes in function or immunological identity are made by selecting substitutions 
that are less conservative than those shown in Chart I. For example, substitutions may be 

25 made which more significantly affect: the structure of the polypeptide backbone in the area of 
the alteration, for example the alpha-helical or beta-sheet structure; the charge or 
hydrophobicity of the molecule at the target site; or the bulk of the side chain. The substitutions 
which in general are expected to produce the greatest changes in the polypeptide's properties 
are those in which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a 

30 hydrophobic residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline 
is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, 
e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g. glutamyl 
or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is substituted for (or 
by) one not having a side chain, e.g. glycine. 



35 The variants typically exhibit the same qualitative biological activity and will elicit the same 
immune response as the naturally-occurring analogue, although variants also are selected to 
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modify the characteristics of the HDAC8 proteins as needed. Alternatively, the variant may be 
designed such that the biological activity of the HDAC8 protein is altered. For example, 
glycosylation sites may be added, altered or removed. HDAC8 variants may be designed to 
add phosphorylation sites or alter or delete phosphorylation sites at, for example, amino acid 
5 positions 36-39 wherein the cAMP mediated phosphorylation consensus sequence "KRAS" is 
found. The tyrosine phosphorylation consensus sequence is found at Tyr-174 , which also may 
be altered. 

Covalent modifications of HDAC8 polypeptides are included within the scope of this invention. 
One type of covalent modification includes reacting targeted amino acid residues of a HDAC8 

1 0 polypeptide with an organic derivatizing agent that is capable of reacting with selected side 
chains or the N-or C-terminal residues of a HDAC8 polypeptide. Derivatization with bifunctional 
agents is useful, for instance, for crosslinking HDAC8 protein to a water-insoluble support 
matrix or surface for use in the method of purifying anti-HDAC8 antibodies or screening assays, 
as is more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 

1 5 bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, 
esters with 4-azidosaiicylic acid, homobifunctional imidoesters, including disuccinimidyl esters 
such as 3 f 3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 
maleimido-1 ,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

20 Other modifications include deamidation of glutaminyl and asparaginyt residues to the 

corresponding glutamyl and aspartyl residues, respectively, hydroxylation of proline and lysine, 
phosphorylation of hydroxyl groups of seryl or threonyl residues, methylation of the "amino 
groups of lysine, arginine, and histidine side chains [T.E. Creighton, Proteins: Structure and 
Molecular Properties . W.H. Freeman & Co., San Francisco, pp. 79-86 (1983)], acetylation of the 

25 N-terminal amine, and amidation of any C-terminal carboxyl group. 

r 

Another type of covalent modification of the HDAC8 polypeptide included within the scope of 
this invention comprises altering the native glycosylation pattern of the polypeptide. "Altering 
the native glycosylation pattern" is intended for purposes herein to mean deleting one or more 
30 carbohydrate moieties found in native sequence HDAC8 polypeptide, and/or adding one or 
more glycosylation sites that are not present in the native sequence HDAC8 polypeptide. 

Addition of glycosylation sites to HDAC8 polypeptides may be accomplished by altering the 
amino acid sequence thereof. The alteration may be made, for example, by the addition of, or 
substitution by, one or more serine or threonine residues to the native sequence HDAC8 
35 polypeptide (for O-linked glycosylation sites). The alteration also may be made, for example, by 
the addition of, or substitution by one or more Axn-Xaa-Ser/Thr sites (Xaa = any amino acid) in 
the native sequence HDAC8 polypeptide (for N-linked glycosylation sites). The HDAC8 amino 
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acid sequence may optionally be altered through changes at the DNA level, particularly by 
mutating the DNA encoding the HDAC8 polypeptide at preselected bases such that codons are 
generated that will translate into the desired amino acids. 

Another means of increasing the number of carbohydrate moieties on the HDAC8 polypeptide is 
5 by chemical or enzymatic coupling of glycosides to the polypeptide. Such methods are 
described in the art, e.g., in WO 87/05330 published 11 September 1987, and in Aplin and 
Wriston, CRC Crit. Rev. Biochem. . pp. 259-306 (1981). 

Removal of carbohydrate moieties present on the HDAC8 polypeptide may be accomplished 
chemically or enzymatically or by mutational substitution of codons encoding for amino acid 
1 0 residues that serve as targets for glycosylation. Chemical deglycosylation techniques are 
known in the art and described, for instance, by Hakimuddin, et al., Arch. Biochem. Biophvs. . 
259:52 (1987) and by Edge et al., Anal. Biochem. . 118:131 (1981). Enzymatic cleavage of 
carbohydrate moieties on polypeptides can be achieved by the use of a variety of endo-and 
exo-glycosidases as described by Thotakura et al., Meth. EnzvmoL 138:350 (1987). 

1 5 Another type of covalent modification of HDAC8 polypeptide comprises linking the HDAC8 
polypeptide to one of a variety of nonproteinaceous polymers, e.g., polyethylene glycol, 
polypropylene glycol, or polyoxyalkylenes, in the manner set forth in U.S. Patent Nos. 
4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

HDAC8 polypeptides of the present invention may also be modified in a way to form chimeric 
20 molecules comprising an HDAC8 polypeptide fused to another, heterologous polypeptide or 
amino acid sequence. In one embodiment, such a chimeric molecule comprises a fusion of a 
HDACjB polypeptide with a tag polypeptide which provides an epitope to which an anti-tag 
antibody can selectively bind. The epitope tag is generally placed at the amino-or carboxyl- 
terminus of the HDAC8 polypeptide but may be incorporated as an internal insertion or 
25 substitution. The presence of such epitope-tagged forms of a HDAC8 polypeptide can be 
detected using an antibody against the tag polypeptide. Also, provision of the epitope tag 
enables the HDAC8 polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fusion of a HDAC8 polypeptide with an 
30 immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of the chimeric 
molecule, such a fusion could be to the Fc region of an IgG molecule as discussed further 
below. 

Various tag polypeptides and their respective antibodies are well known in the art. Examples 
include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; the flu HA tag 
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polypeptide and its antibody 12CA5 [Field et al., Mol. Cell. Biol. . 8:2159-2165 (1988)]; the c-myc 
tag and the 8F9, 3C7, 6E10, G4 f B7 and 9E10 antibodies thereto [Evan et al., Molecular and 
Cellular Biology . 5:3610-3616 (1985)]; and the Herpes Simplex virus glycoprotein D (gD) tag 
and its antibody [Paborsky et al., Protein Engineering . 3(6):547-553 (1990)]. Other tag 
5 polypeptides include the Flag-peptide [Hopp et al., BioTechnoloav . 6:1204-1210 (1988)]; the 
KT3 epitope peptide [Martin et al., Science . 255:192-194 (1992)]; tubulin epitope peptide 
[Skinner et al., J. Biol. Chem. . 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide 
tag [Lutz-Freyermuth et al., Proc. Natl. Acad. Sci. USA . 87:6393-6397 (1990)] and the histidine 
tag and metal binding sites (Smith, Ann. NY. Acad. Sci., 646:315-321 (1991)], with the Flag and 
1 0 histidine tag being preferred. 

In an embodiment herein, HDAC8 proteins of the HDAC family and HDAC proteins from other 
organisms are cloned and expressed as outlined below. Thus, probe or degenerate 
polymerase chain reaction (PCR) primer sequences may be used to find other related HDAC8 
proteins from humans or other organisms. As will be appreciated by those in the art, 

1 5 particularly useful probe and/or PCR primer sequences include the unique areas of the HDAC8 
nucleic acid sequence. As is generally known in the art, preferred PCR primers are from about 
15 to about 35 nucleotides in length, with from about 20 to about 30 being preferred, and may 
contain inosine as needed. The conditions for the PCR reaction are well known in the art. It is 
therefore also understood that provided along with the sequences in the sequences listed 

20 herein are portions of those sequences, wherein unique portions of 15 nucleotides or more are 
particularly preferred. The skilled artisan can routinely synthesize or cut a nucleotide sequence 
to the desired length. 

Once isolated from its natural source, e.g., contained within a plasmid or other vector or excised 
therefrom as a linear nucleic acid segment, the recombinant HDAC8 nucleic acid can be 
25 further-used as a probe to identify and isolate other HDAC8 nucleic acids. It can also be used 
as a "precursor" nucleic acid to make modified or variant HDAC8 nucleic acids and proteins. 

Using the nucleic acids of the present invention which encode an HDAC8 protein, a variety of 
expression vectors are made. The expression vectors may be either self-replicating 
extrachromosomal vectors or vectors which integrate into a host genome. Generally, these 

30 expression vectors include transcriptional and translational regulatory nucleic acid operably 
linked to the nucleic acid encoding the HDAC8 protein. The term "control sequences" refers to 
nucleic acid sequences necessary for the expression of an operably linked coding sequence in 
a particular host organism. The control sequences that are suitable for prokaryotes, for 
example, include a promoter, optionally an operator sequence, and a ribosome binding site. 

35 Eukaryotic cells are known to utilize promoters, polyadenylation signals, and enhancers. 
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Nucleic acid is "operably linked" when it is placed into a functional relationship with another 
nucleic acid sequence. For example, DNA for a presequence or secretory leader is operably 
linked to DNA for a polypeptide if it is expressed as a preprotein that participates in the 
secretion of the polypeptide; a promoter or enhancer is operably linked to a coding sequence if 
5 it affects the transcription of the sequence; or a ribosome binding site is operably linked to a 
coding sequence if it is positioned so as to facilitate translation. As another example, operably 
linked refers to DNA sequences linked so as to be contiguous, and, in the case of a secretory 
leader, contiguous and in reading phase. However, enhancers do not have to be contiguous. 
Linking is accomplished by ligation at convenient restriction sites. If such sites do not exist, the 

1 0 synthetic oligonucleotide adaptors or linkers are used in accordance with conventional practice. 
The transcriptional and translational regulatory nucleic acid will generally be appropriate to the 
host cell used to express the HDAC8 protein; for example, transcriptional and translational 
regulatory nucleic acid sequences from Bacillus are preferably used to express the HDAC8 
protein in Bacillus. Numerous types of appropriate expression vectors, and suitable regulatory 

1 5 sequences are known in the art for a variety of host cells. 

In general, the transcriptional and translational regulatory sequences may include, but are not 
limited to, promoter sequences, ribosomal binding sites, transcriptional start and stop 
sequences, translational start and stop sequences, and enhancer or activator sequences. In a 
preferred embodiment, the regulatory sequences include a promoter and transcriptional start 
20 and stop sequences. 

Promoter sequences encode either constitutive or inducible promoters. The promoters may be 
either naturally occurring promoters or hybrid promoters. Hybrid promoters, which combine 
elements of more than one promoter, are also known in the art, and are useful in the present 
invention. 

25 In addition, the expression vector may comprise additional elements. For example, the 
expression vector may have two replication systems, thus allowing it to be maintained in two 
organisms, for example in mammalian or insect cells for expression and in a procaryotic host for 
cloning and amplification. Furthermore, for integrating expression vectors, the expression 
vector contains at least one sequence homologous to the host cell genome, and preferably two 

30 homologous sequences which flank the expression construct. The integrating vector may be 
directed to a specific locus in the host cejl by selecting the appropriate homologous sequence 
for inclusion in the vector. Constructs for integrating vectors are well known in the art. 

In addition, in a preferred embodiment, the expression vector contains a selectable marker 
gene to allow the selection of transformed host cells. Selection genes are well known in the art 
35 and will vary with the host cell used. 
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A preferred expression vector system is a retroviral vector system such as is generally 
described in PCT/US97/01019 and PCT/US97/01048, both of which are hereby expressly 
incorporated by reference. 

HDAC8 proteins of the present invention are produced by culturing a host cell transformed with 
5 an expression vector containing nucleic acid encoding a HDAC8 protein, under the appropriate 
conditions to induce or cause expression of the HDAC8 protein. The conditions appropriate for 
HDAC8 protein expression will vary with the choice of the expression vector and the host cell, 
and will be easily ascertained by one skilled in the art through routine experimentation. For 
example, the use of constitutive promoters in the expression vector will require optimizing the 
1 0 growth and proliferation of the host cell, while the use of an inducible promoter requires the 
appropriate growth conditions for induction. In addition, in some embodiments, the timing of the 
harvest is important. For example, the baculoviral systems used in insect cell expression are 
lytic viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archebacteria, fungi, and insect and animal cells, 
1 5 including mammalian cells. Of particular interest are Drosophila melangaster cells, 

Saccharomyces cerevisiae and other yeasts, E. co//, Bacillus subtiiis, SF9 cells, C129 cells, 293 
cells, Neurospora, BHK, CHO, COS, and HeLa cells, fibroblasts, Schwanoma cell lines, 
immortalized mammalian myeloid and lymphoid cell lines, with HeLa and SF9 cells being 
preferred. 

20 In a preferred embodiment, the HDAC8 proteins are expressed in mammalian cells. 

Mammalian expression systems are also known in the art, and include retroviral systems. A 
mammalian promoter is any DNA sequence capable of binding mammalian RNA polymerase 
and initiating the downstream (3*) transcription of a coding sequence for HDAC8 protein into 
mRNA. A promoter will have a transcription initiating region, which is usually placed proximal to 

25 the 5' end of the coding sequence, and a TATA box, usually located about 25-30 base pairs 
upstream of the transcription initiation site. The TATA box is thought to direct RNA polymerase 
II to begin RNA synthesis at the correct site. A mammalian promoter will also contain an 
upstream promoter element (enhancer element), typically located within 100 to 200 base pairs 
upstream of the TATA box. An upstream promoter element determines the rate at which 

30 transcription is initiated and can act in either orientation. Of particular use as mammalian 

promoters are the promoters from mammalian viral genes, since the viral genes are often highly 
expressed and have a broad host range. Examples include the SV40 early promoter, mouse 
mammary tumor virus LTR promoter, adenovirus major late promoter, herpes simplex virus 
promoter, and the CMV promoter. 

35 Typically, transcription termination and polyadenylation sequences recognized by mammalian 
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cells are regulatory regions located 3* to the translation stop codon and thus, together with the 
promoter elements, flank the coding sequence. The 3* terminus of the mature mRNA is formed 
by site-specific post-translational cleavage and polyadenylation. Examples of transcription 
terminator and polyadenlytion signals include those derived form SV40. 

5 The methods of introducing exogenous nucleic acid into mammalian hosts, as well as other 
hosts, is well known in the art, and will vary with the host cell used. Techniques include 
dextran-mediated transfection, calcium phosphate precipitation, polybrene mediated 
transfection, protoplast fusion, electroporation, viral infection, encapsulation of the 
polynucleotide(s) in liposomes, and direct microinjection of the DNA into nuclei. 

10 In a preferred embodiment, HDAC8 proteins are expressed in bacterial systems. Bacterial 
expression systems are well known in the art. 

A suitable bacterial promoter is any nucleic acid sequence capable of binding bacterial RNA 
polymerase and initiating the downstream (3') transcription of the coding sequence of HDAC8 
protein into mRNA. A bacterial promoter has a transcription initiation region which is usually 

1 5 placed proximal to the 5' end of the coding sequence. This transcription initiation region 

typically includes an RNA polymerase binding site and a transcription initiation site. Sequences 
encoding metabolic pathway enzymes provide particularly useful promoter sequences. 
Examples include promoter sequences derived from sugar metabolizing enzymes, such as 
galactose, lactose and maltose, and sequences derived from biosynthetic enzymes such as 

20 tryptophan. Promoters from bacteriophage may also be used and are known in the art. In 
addition, synthetic promoters and hybrid promoters are also useful; for example, the tac 
promoter is a hybrid of the trp and lac promoter sequences. Furthermore, a bacterial promoter 
can include naturally occurring promoters of non-bacterial origin that have the ability to bind 
bacterial RNA polymerase and initiate transcription. 

25 

In addition to a functioning promoter sequence, an efficient ribosome binding site is desirable. 
In E. co//, the ribosome binding site is called the Shine-Delgarno (SD) sequence and includes 
an initiation codon and a sequence about 3-9 nucleotides in length located about 3-1 1 
nucleotides upstream of the initiation codon. 

30 The expression vector may also include a signal peptide sequence that provides for secretion of 
the HDAC8 protein in bacteria. The signal sequence typically encodes a signal peptide 
comprised of hydrophobic amino acids which direct the secretion of the protein from the cell, as 
is well known in the art. The protein is either secreted into the growth media (gram-positive 
bacteria) or into the periplasmic space, located between the inner and outer membrane of the 

35 cell (gram-negative bacteria). 
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The bacterial expression vector may also Include a selectable marker gene to allow for the 
selection of bacterial strains that have been transformed. Suitable selection genes include 
genes which render the bacteria resistant to drugs such as ampicillin, chloramphenicol, 
erythromycin, kanamycin, neomycin and tetracycline. Selectable markers also include 
5 biosynthetic genes, such as those in the histidine, tryptophan and leucine biosynthetic 
pathways. 

These components are assembled into expression vectors. Expression vectors for bacteria are 
well known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others. 

1 0 The bacterial expression vectors are transformed into bacterial host cells using techniques well 
known in the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, HDAC8 proteins are produced in insect cells. Expression vectors for the 
transformation of insect cells, and in particular, baculovirus-based expression vectors, are well 
known in the art. 

15 In a preferred embodiment, HDAC8 protein is produced in yeast cells. Yeast expression 
systems are well known in the art, and include expression vectors for Saccharomyces 
cerevlsiae t Candida albicans and C. maltosa, Hansenula polymorpha t Kluyveromyces fragilis 
and K. Iactis t Pichia guillerimondii and P. pastoris, Schizosaccharomyces pombe, and Yarrowia 
lipolytica. Preferred promoter sequences for expression in yeast include the inducible GAL1 ,10 

20 promoter, the promoters from alcohol dehydrogenase, enolase, glucokinase, glucose-6- 
phosphate isomerase, glyceraldehyde-3-phosphate-dehydrogenase, hexokinase, 
phosphofructokinase, 3-phosphoglycerate mutase, pyruvate kinase, and acid phosphatase 
genes. Yeast selectable markers include ADE2, HIS4, LEU2, TRP1, and ALG7, which confers 
resistance to tunicamycin; the neomycin phosphotransferase gene, which confers resistance to 

25 G418; and the CUP1 gene, which allows yeast to grow in the presence of copper ions. 

The HDAC8 protein may also be made as a fusion protein, using techniques well known in the 
art. Thus, for example, for the creation of monoclonal antibodies, if the desired epitope is small, 
the HDAC8 protein may be fused to a carrier protein to form an immunogen. Alternatively, the 
HDAC8 may be made as a fusion protein to increase expression, or for other reasons. For 
30 example, when the HDAC8 protein is an HDAC8 peptide, the nucleic acid encoding the peptide 
may be linked to another nucleic acid for expression purposes. Similarly, HDAC8 proteins of 
the invention can be linked to protein labels, such as green fluorescent protein (GFP), red 
fluorescent protein (RFP), blue fluorescent protein (BFP), yellow fluorescent protein (YFP), etc. 
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In one embodiment, the HDAC8 nucleic acids, proteins and antibodies of the invention are 
labeled. By "labeled" herein is meant that a compound has at least one element, isotope or 
chemical compound attached to enable the detection of the compound. In general, labels fall 
into three classes: a) isotopic labels, which may be radioactive or heavy isotopes; b) immune 
5 labels, which may be antibodies or antigens; and c) colored or fluorescent dyes. The labels 
may be incorporated into the compound at any position. 

In a preferred embodiment, the HDAC8 protein is purified or isolated after expression. HDAC8 
proteins may be isolated or purified in a variety of ways known to those skilled in the art 
depending on what other components are present in the sample. Standard purification methods 

1 0 include electrophoretic, molecular, immunological and chromatographic techniques, including 
ion exchange, hydrophobic, affinity, and reverse-phase HPLC chromatography, and 
chromatofocusing. For example, the HDAC8 protein may be purified using a standard anti- 
HDAC8 antibody column. Ultrafiltration and diafiltration techniques, in conjunction with protein 
concentration, are also useful. For general guidance in suitable purification techniques, see 

1 5 Scopes, R., Protein Purification, Springer-Verlag, NY (1982). The degree of purification 
necessary will vary depending on the use of the HDAC8 protein. In some instances no 
purification will be necessary. 

Once expressed and purified, if necessary, the HDAC8 proteins and nucleic acids are useful in 
a number of applications. 

20 The nucleotide sequences (or their complement) encoding HDAC8 proteins have various 
applications in the art of molecular biology, including uses as hybridization probes, in 
chromosome and gene mapping and in the generation of anti-sense RNA and DNA. HDAC8 
protein nucleic acid will also be useful for the preparation of HDAC8 proteins by the 
recombinant techniques described herein. 

25 The full-length native sequence HDAC8 protein gene, or portions thereof, may be used as 
hybridization probes for a cDNA library to isolate other genes (for instance, those encoding 
naturally-occurring variants of HDAC8 protein or HDAC8 from other species) which have a 
desired sequence identity to the HDAC8 protein coding sequence. Optionally, the length of the 
probes will be about 20 to about 50 bases. The hybridization probes may be derived from the 

30 nucleotide sequences herein or from genomic sequences including promoters, enhancer 

elements and introns of native sequences as provided herein. By way of example, a screening 
method will comprise isolating the coding region of the HDAC8 protein gene using the known 
DNA sequence to synthesize a selected probe of about 40 bases. Hybridization probes may be 
labeled by a variety of labels, including radionucleotides such as ^P or ^S, or enzymatic labels 

35 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems. Labeled 
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probes having a sequence complementary to that of the HDAC8 protein gene of the present 
invention can be used to screen libraries of human cDNA, genomic DNA or mRNA to determine 
to which members of such libraries the probe hybridizes. 

Nucleotide sequences encoding a HDAC8 protein can also be used to construct hybridization 
5 probes for mapping the gene which encodes that HDAC8 protein and for the genetic analysis of 
individuals with genetic disorders. The nucleotide sequences provided herein may be mapped 
to a chromosome and specific regions of a chromosome using known techniques, such as in 
situ hybridization, linkage analysis against known chromosomal markers, and hybridization 
screening with libraries. 

1 0 Nucleic acids which encode HDAC8 protein or its modified forms can also be used to generate 
either transgenic animals or "knock out" animals which, in turn, are useful in the development 
and screening of therapeutically useful reagents. A transgenic animal (e.g., a mouse or rat) is 
an animal having cells that contain a transgene, which transgene was introduced into the animal 
or an ancestor of the animal at a prenatal, e.g., an embryonic stage. A transgene is a DNA 

1 5 which is integrated into the genome of a cell from which a transgenic animal develops. In one 
embodiment, cDNA encoding an HDAC8 protein can be used to clone genomic DNA encoding 
a HDAC8 protein in accordance with established techniques and the genomic sequences used 
to generate transgenic animals that contain cells which express the desired DNA. Methods for 
generating transgenic animals, particularly animals such as mice or rats, have become 

20 conventional in the art and are described, for example, in U.S. Patent Nos. 4,736,866 and 
4,870,009. Typically, particular cells would be targeted for the HDAC8 protein transgene 
incorporation with tissue-specific enhancers. Transgenic animals that include a copy of a 
transgene encoding a HDAC8 protein introduced into the germ line of the animal at an 
embryonic stage can be used to examine the effect of increased expression of the desired 

25 nucleic acid. Such animals can be used as tester animals for reagents thought to confer 
protection from, for example, pathological conditions associated with its overexpression. In 
accordance with this facet of the invention, an animal is treated with the reagent and a reduced 
incidence of the pathological condition, compared to untreated animals bearing the transgene, 
would indicate a potential therapeutic intervention for the pathological condition. 

30 Alternatively, non-human homoiogues of the HDAC8 protein can be used to construct an 
HDAC8 protein "knock our animal which has a defective or altered gene encoding an HDAC8 
protein as a result of homologous recombination between the endogenous gene encoding an 
HDAC8 protein and altered genomic DNA encoding an HDAC8 protein introduced into an 
embryonic cell of the animal. For example, cDNA encoding a HDAC8 protein can be used to 

35 clone genomic DNA encoding a HDAC8 protein in accordance with established techniques. A 
portion of the genomic DNA encoding a HDAC8 protein can be deleted or replaced with another 
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gene, such as a gene encoding a selectable marker which can be used to monitor integration. 
Typically, several kilobases of unaltered flanking DNA (both at the 5* and 3' ends) are included 
in the vector [see e.g., Thomas and Capecchi, Cell . 51:503 (1987) for a description of 
homologous recombination vectors]. The vector is introduced into an embryonic stem cell line 
5 (e.g., by electroporation) and cells in which the introduced DNA has homologously recombined 
with the endogenous DNA are selected [see e.g., Li et al. f CeH, 69:91 5 (1992)]. The selected 
cells are then injected into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation 
chimeras [see e.g., Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach, E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-152]. A chimeric embryo can then 

1 0 be implanted into a suitable pseudopregnant female foster animal and the embryo brought to 
term to create a "knock out" animal. Progeny harboring the homologously recombined DNA in 
their germ cells can be identified by standard techniques and used to breed animals in which all 
cells of the animal contain the homologously recombined DNA. Knockout animals can be 
characterized for instance, for their ability to defend against certain pathological conditions and 

1 5 for their development of pathological conditions due to absence of the HDAC8 protein. 

It is understood that the models described herein can be varied. For example, "knock-in" 
models can be formed, or the models can be cell-based rather than animal models. 

Nucleic acid encoding the HDAC8 polypeptides, antagonists or agonists thereof may also be 
used in gene therapy. In gene therapy applications, genes are introduced into cells in order to 

20 achieve in vivo synthesis of a therapeutically effective genetic product, for example for 
replacement of a defective gene. "Gene therapy" includes both conventional gene therapy 
where a lasting effect is achieved by a single treatment, and the administration of gene 
therapeutic agents, which involves the one time or repeated administration of a therapeutically 
effective DNA or mRNA. Antisense RNAs and DNAs can be used as therapeutic agents for 

25 blocking the expression of certain genes in vivo. It has already been shown that short 

antisense oligonucleotides can be imported into cells where they act as inhibitors, despite their 
low intracellular concentrations caused by their restricted uptake by the cell membrane. 
(Zamecnik et al., Proc. Natl. Acad. Sci. USA 83, 4143-4146 [1986]). The oligonucleotides can 
be modified to enhance their uptake, e.g. by substituting their negatively charged 

30 phosphodiester groups by uncharged groups. 

There are a variety of techniques available for introducing nucleic acids into viable cells. The 
techniques vary depending upon whether the nucleic acid is transferred into cultured cells in 
vitro, or in vivo in the cells of the intended host. Techniques suitable for the transfer of nucleic 
acid into mammalian cells in vitro include the use of liposomes, electroporation, microinjection, 
35 cell fusion, DEAE-dextran, the calcium phosphate precipitation method, etc. The currently 
preferred in vivo gene transfer techniques include transfection with viral (typically retroviral) 
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vectors and viral coat protein-liposome mediated transfection (Dzau et a/., Trends in 
Biotechnology 11, 205-210 [1993]). In some situations it is desirable to provide the nucleic acid 
source with an agent that targets the target cells, such as an antibody specific for a cell surface 
membrane protein or the target cell, a ligand for a receptor on the target cell, etc. Where 
5 liposomes are employed, proteins which bind to a cell surface membrane protein associated 
with endocytosis may be used for targeting and/or to facilitate uptake, e.g. capsid proteins or 
fragments thereof tropic for a particular cell type, antibodies for proteins which undergo 
internalization in cycling, proteins that target intracellular localization and enhance intracellular 
half-life. The technique of receptor-mediated endocytosis is described, for example, by Wu et 
10 at. , J. Biol. Chem. 262 . 4429-4432 (1 987); and Wagner et al. , Proc. Natl. Acad. Sci. USA 87, 
3410-3414 (1990). For review of gene marking and gene therapy protocols see Anderson et 
aL Science 256 . 808-813 (1992). 

In a preferred embodiment, the HDAC8 proteins, nucleic acids, variants, modified proteins, cells 
and/or transgenics containing the said nucleic acids or proteins are used in screening assays. 
1 5 Identification of the HDAC8 protein provided herein permits the design of drug screening assays 
for compounds that bind or interfere with the binding to the HDAC8 protein and for compounds 
which modulate HDAC8 activity. 

The assays described herein preferably utilize the human HDAC8 protein, although other 
mammalian proteins may also be used, including rodents (mice, rats, hamsters, guinea pigs, 
20 etc.), farm animals (cows, sheep, pigs, horses, etc.) and primates. These latter embodiments 
may be preferred in the development of animal models of human disease. In some 
embodiments, as outlined herein, variant or derivative HDAC8 proteins may be used, including 
deletion HDAC8 proteins as outlined above. 

In a preferred embodiment, the methods comprise combining an HDAC8 protein and a 
25 candidate bioactive agent, and determining the binding of the candidate agent to the HDAC8 
protein. In other embodiments, further discussed below, binding interference or bioactivity is 
determined. 

The term "candidate bioactive agent" or "exogeneous compound" as used herein describes any 
molecule, e.g., protein, small organic molecule, carbohydrates (including polysaccharides), 
30 polynucleotide, lipids, etc. Generally a plurality of assay mixtures are run in parallel with 
different agent concentrations to obtain a differential response to the various concentrations. 
Typically, one of these concentrations serves as a negative control, /.e., at zero concentration 
or below the level of detection. In addition, positive controls, i.e. the use of agents known to 
alter or modulate HDAC activity, may be used. 
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Candidate agents encompass numerous chemical classes, though typically they are organic 
molecules, preferably small organic compounds having a molecular weight of more than about 
100 daltons and less than about 2,500 daltons. Candidate agents comprise functional groups 
necessary for structural interaction with proteins, particularly hydrogen bonding, and typically 
5 include at least an amine, carbonyl, hydroxyl or carboxyl group, preferably at least two of the 
functional chemical groups. The candidate agents often comprise cyclical carbon or 
heterocyclic structures and/or aromatic or polyaromatic structures substituted with one or more 
of the above functional groups. Candidate agents are also found among biomolecules including 
peptides, saccharides, fatty acids, steroids, purines, pyrimidines, derivatives, structural analogs 
10 or combinations thereof. Particularly preferred are peptides. 

Candidate agents are obtained from a wide variety of sources including libraries of synthetic or 
natural compounds. For example, numerous means are available for random and directed 
synthesis of a wide variety of organic compounds and biomolecules, including expression of 
randomized oligonucleotides. Alternatively, libraries of natural compounds in the form of 
1 5 bacterial, fungal, plant and animal extracts are available or readily produced. Additionally, 
natural or synthetically produced libraries and compounds are readily modified through 
conventional chemical, physical and biochemical means. Known pharmacological agents may 
be subjected to directed or random chemical modifications, such as acylation, alkylation, 
esterification, amidification to produce structural analogs. 

20 In a preferred embodiment, a library of different candidate bioactive agents are used. 

Preferably, the library should provide a sufficiently structurally diverse population of randomized 
agents to effect a probabilistically sufficient range of diversity to allow binding to a particular 
target. Accordingly, an interaction library should be large enough so that at least one of its 
members will have a structure that gives it affinity for the target. Although it is difficult to gauge 

25 the required absolute size of an interaction library, nature provides a hint with the immune 
response: a diversity of 10 7 -10 8 different antibodies provides at least one combination with 
sufficient affinity to interact with most potential antigens faced by an organism. Published in 
vitro selection techniques have also shown that a library size of 10 7 to 10 8 is sufficient to find 
structures with affinity for the target. A library of all combinations of a peptide 7 to 20 amino 

30 acids in length, such as generally proposed herein, has the potential to code for 20 7 (10 9 ) to 20 20 
. Thus, with libraries of 10 7 to 10 8 different molecules the present methods allow a "working" 
subset of a theoretically complete interaction library for 7 amino acids, and a subset of shapes 
for the 20 20 library. Thus, in a preferred embodiment, at least 10 6 , preferably at least 10 7 , more 
preferably at least 10 8 and most preferably at least 10 9 different sequences are simultaneously 

35 analyzed in the subject methods. Preferred methods maximize library size and diversity. 

In a preferred embodiment, the candidate bioactive agents are proteins. By "protein" herein is 
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meant at least two covalently attached amino acids, which includes proteins, polypeptides, 
oligopeptides and peptides. The protein may be made up of naturally occurring amino acids 
and peptide bonds, or synthetic peptidomimetic structures. Thus "amino acid", or "peptide 
residue", as used herein means both naturally occurring and synthetic amino acids. For 
5 example, homo-phenylalanine, citrulline and noreleucine are considered amino acids for the 
purposes of the invention. "Amino acid" also includes imino acid residues such as proline and 
hydroxyproline. The side chains may be in either the (R) or the (S) configuration. In the 
preferred embodiment, the amino acids are in the (S) or L-configuration. If non-naturally 
occurring side chains are used, non-amino acid substituents may be used, for example to 
1 0 prevent or retard in vivo degradations. Chemical blocking groups or other chemical substituents 
may also be added. 

In a preferred embodiment, the candidate bioactive agents are naturally occurring proteins or 
fragments of naturally occurring proteins. Thus, for example, cellular extracts containing 
proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In this 
1 5 way libraries of procaryotic and eukaryotic proteins may be made for screening in the systems 
described herein. Particularly preferred in this embodiment are libraries of bacterial, fungal, 
viral, and mammalian proteins, with the latter being preferred, and human proteins being 
especially preferred. 

In a preferred embodiment, the candidate bioactive agents are peptides of from about 5 to 
20 about 30 amino acids, with from about 5 to about 20 amino acids being preferred, and from 
about 7 to about 15 being particularly preferred. The peptides may be digests of naturally 
occurring proteins as is outlined above, random peptides, or "biased" random peptides. By 
"randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
25 these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of ail or most 
of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

30 In one embodiment, the library is fully randomized, with no sequence preferences or constants 
at any position, in a preferred embodiment, the library is biased. That is, some positions within 
the sequence are either held constant, or are selected from a limited number of possibilities. 
For example, in a preferred embodiment, the nucleotides or amino acid residues are 
randomized within a defined class, for example, of hydrophobic amino acids, hydrophilic 

35 residues, sterically biased (either small or large) residues, towards the creation of cysteines, for 
cross-linking, prolines for SH-3 domains, serines, threonines, tyrosines or histidines for 
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phosphorylation sites, etc., or to purines, etc. 

In a preferred embodiment, the candidate bioactive agents are nucleic acids. By "nucleic acid" 
or "oligonucleotide" or grammatical equivalents herein means at least two nucleotides 
covalently linked together. A nucleic acid of the present invention will generally contain 
5 phosphodiester bonds, although in some cases, as outlined below, nucleic acid analogs are 
included that may have alternate backbones, comprising, for example, phosphoramide 
(Beaucage, et ai t Tetrahedron, 49(1 0):1 925 (1993) and references therein; Letsinger, J. Org. 
Chem. . 35:3800 (1970); Sprinzl, et a/. ( Eur. J. Biochem. . 81:579 (1977); Letsinger, et at., Nucl. 
Acids Res. . 14:3487 (1986); Sawai, et a/., Chem. Lett. . 805 (1984), Letsinger, et a/ M J.Am. 

10 Chem. Soc . 110:4470 (1988); and Pauwels, et a/., Chemica Scripta , 26:141 (1986)), 
phosphorothioate (Mag, et a/., Nucleic Acids Res. . 19:1437 (1991); and U.S. Patent No. 
5,644,048), phosphorodithioate (Briu, et a/., J. Am. Chem. Soc . 111:2321 (1989)), O- 
methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A Practical 
Approach, Oxford University Press), and peptide nucleic acid backbones and linkages (see 

15 Eghoim, J. Am. Chem. Soc . 114:1895 (1992); Meier, ef a/., Chem. Int. Ed. Enal. . 31:1008 
(1992); Nielsen, Nature . 365:566 (1993); Carisson, ef a/., Nature . 380:207 (1996), all of which 
are incorporated by reference)). Other analog nucleic acids include those with positive 
backbones (Denpcy, et aL, Proc. Natl. Acad. Sci. USA . 92:6097 (1995)); non-ionic backbones 
(U.S. Patent Nos. 5,386,023; 5,637,684; 5,602,240; 5,216,141; and 4,469,863; Kiedrowshi, et 

20 a/., Angew. Chem. Intl. Ed. English . 30:423 (1 991 ); Letsinger, et a/., J. Am. Chem. Soc . 

110:4470 (1988); Letsinger, e/aA, Nucleoside & Nucleotide, 13:1597 (1994); Chapters 2 and 3, 
ASC Symposium Series 580, "Carbohydrate Modifications in Antisense Research", Ed. Y.S. 
Sanghui and P. Dan Cook; Mesmaeker, et a/., Biooroanic & Medicinal Chem. Lett.. 4:395 
(1994); Jeffs, et aL, J. Biomolecular NMR . 34:17 (1994); Tetrahedron Lett., 37:743 (1996)) and 

25 non-ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, 
and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate Modifications in Antisense 
Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or more 
carbocyclic sugars are also included within the definition of nucleic acids (see Jenkins, et a/., 
Chem. Soc Rev.. (1995) pp. 169-176). Several nucleic acid analogs are described in Rawls, C 

30 & E News, June 2, 1997, page 35. All of these references are hereby expressly incorporated by 
reference. These modifications of the ribose-phosphate backbone may be done to facilitate the 
addition of additional moieties such as labels, or to increase the stability and half-life of such 
molecules in physiological environments. In addition, mixtures of naturally occurring nucleic 
acids and analogs can be made. Alternatively, mixtures of different nucleic acid analogs, and 

35 mixtures of naturally occurring nucleic acids and analogs may be made. The nucleic acids may 
be single stranded or double stranded, as specified, or contain portions of both double stranded 
or single stranded sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or 
a hybrid, where the nucleic acid contains any combination of deoxyribo- and ribo-nucleotides, 
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and any combination of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 
xathanine hypoxathanine, isocytosine, isoguanine, etc. 

As described above generally for proteins, nucleic acid candidate bioactive agents may be 
naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. For 
5 example, digests of procaryotic or eukaryotic genomes may be used as is outlined above for 
proteins. 

In a preferred embodiment, the candidate bioactive agents are organic chemical moieties, a 
wide variety of which are available in the literature. 

In a preferred embodiment, the candidate bioactive agents are linked to a fusion partner. By 
1 0 "fusion partner" or "functional group" herein is meant a sequence that is associated with the 
candidate bioactive agent, that confers upon all members of the library in that class a common 
function or ability. Fusion partners can be heterologous (i.e. not native to the host cell), or 
synthetic (not native to any cell). Suitable fusion partners include, but are not limited to: a) 
presentation structures, which provide the candidate bioactive agents in a conformationally 
1 5 restricted or stable form; b) targeting sequences, which allow the localization of the candidate 
bioactive agent into a subcellular or extracellular compartment; c) rescue sequences which 
allow the purification or isolation of either the candidate bioactive agents or the nucleic acids 
encoding them; d) stability sequences, which confer stability or protection from degradation to 
the candidate bioactive agent or the nucleic acid encoding it, for example resistance to 
20 proteolytic degradation; e) dimerization sequences, to allow for peptide dimerization; or f) any 
combination of a), b), c), d), and e), as well as linker sequences as needed. 

In one embodiment of the methods described herein, portions of HDAC8 proteins are utilized; in 
a preferred embodiment, portions having HDAC8 activity are used to identify agents that bind to 
HDAC8. In addition, the assays described herein may utilize either isolated HDAC8 proteins or 
25 cells comprising the HDAC8 proteins. 

Generally, in a preferred embodiment of the methods herein, for example for binding assays, 
the HDAC8 protein or the candidate agent is non-diffusibly bound to an insoluble support having 
isolated sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports 
may be made of any composition to which the compositions can be bound, is readily separated 
30 from soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 
teflon™, etc. Microtiter plates and arrays are especially convenient because a large number of 
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assays can be carried out simultaneously, using small amounts of reagents and samples. In 
some cases magnetic beads and the like are included. The particular manner of binding of the 
composition is not crucial so long as it is compatible with the reagents and overall methods of 
the invention, maintains the activity of the composition and is nondiffusable. Preferred methods 
5 of binding include the use of antibodies (which do not sterically block either the ligand binding 
site or activation sequence when the protein is bound to the support), direct binding to "sticky" 
or ionic supports, chemical crosslinking, the synthesis of the protein or agent on the surface, 
etc. Following binding of the protein or agent, excess unbound material is removed by washing. 
The sample receiving areas may then be blocked through incubation with bovine serum albumin 
1 0 (BSA), casein or other innocuous protein or other moiety. Also included in this invention are 
screening assays wherein solid supports are not used; examples of such are described below. 

In a preferred embodiment, the HDAC8 protein is bound to the support, and a candidate 
bioactive agent is added to the assay. Alternatively, the candidate agent is bound to the 
support and the HDAC8 protein is added. Novel binding agents include specific antibodies, 

1 5 non-natural binding agents identified in screens of chemical libraries, peptide analogs, etc. Of 
particular interest are screening assays for agents that have a low toxicity for human cells. A 
wide variety of assays may be used for this purpose, including labeled in vitro protein-protein 
binding assays, electrophoretic mobility shift assays, immunoassays for protein binding, 
functional assays, preferably deacetylation of short acetylated peptides or labeled acetylated 

20 peptides. 

The determination of the binding of the candidate bioactive agent to the HDAC8 protein may be 
done in a number of ways. In a preferred embodiment, the candidate bioactive agent is labelled, 
and binding determined directly. For example, this may be done by attaching all or a portion of 
the HDAC8 protein to a solid support, adding a labelled candidate agent (for example a radio or 
25 fluorescent label), washing off excess reagent, and determining whether the label is present on 
the solid support. Various blocking and washing steps may be utilized as is known in the art. 

By "labeled" herein is meant that the compound is either directly or indirectly labeled with a label 
which provides a detectable signal, e.g. radioisotope, fluoresces, enzyme, antibodies, particles 
such as magnetic particles, chemiluminescers, or specific binding molecules, etc. Specific 
30 binding molecules include pairs, such as biotin and streptavidin, digoxin and antidigoxin etc. For 
the specific binding members, the complementary member would normally be labeled with a 
molecule which provides for detection, in accordance with known procedures, as outlined 
above. The label can directly or indirectly provide a detectable signal. 

In some embodiments, only one of the components is labeled. For example, the proteins (or 
35 proteinaceous candidate agents) may be labeled at tyrosine positions using 125 l, or with 
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fluorophores. Alternatively, more than one component may be labeled with different labels; 
using 125 l for the proteins, for example, and a fluorophor for the candidate agents. 

In a preferred embodiment, the binding of the candidate bioactive agent is determined through 
4 the use of competitive binding assays. In this embodiment, the competitor is a binding moiety 
5 known to bind to the target molecule (i.e. HDAC8 protein), such as an antibody, peptide, binding 
partner, ligand, etc. Under certain circumstances, there may be competitive binding as between 
the bioactive agent and the binding moiety, with the binding moiety displacing the bioactive 
agent. This assay can be used to determine candidate agents which interfere with binding 
between HDAC8 proteins and binding partners. "Interference of binding" as used herein means 
1 0 that native binding of the HDAC8 protein differs in the presence of the candidate agent. The 
binding can be eliminated or can be with a reduced affinity. Therefore, in one embodiment, 
interference is caused by, for example, a conformation change, rather than direct competition 
for the native binding site. 

In one embodiment, the candidate bioactive agent is labeled. Either the candidate bioactive 
1 5 agent, or the competitor, or both, is added first to the protein for a time sufficient to allow 

binding, if present. Incubations may be performed at any temperature which facilitates optimal 
activity, typically between 4 and 40°C. Incubation periods are selected for optimum activity, but 
may also be optimized to facilitate rapid high through put screening. Typically between about 
0.1 and about 1 .0 hour will be sufficient. Excess reagent is generally removed or washed away. 
20 The second component is then added, and the presence or absence of the labeled component 
is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the candidate bioactive 
agent. Displacement of the competitor is an indication that the candidate bioactive agent is 
binding to the HDAC8 protein and thus is capable of binding to, and potentially modulating, the 
25 activity of the HDAC8 protein. In this embodiment, either component can be labeled. Thus, for 
example, if the competitor is labeled, the presence of label in the wash solution indicates 
displacement by the agent. Alternatively, if the candidate bioactive agent is labeled, the 
presence of the label on the support indicates displacement. 

In an alternative embodiment, the candidate bioactive agent is added first, with incubation and 
30 washing, followed by the competitor. The absence of binding by the competitor may indicate 
that the bioactive agent is bound to the HDAC8 protein with a higher affinity. Thus, if the 
candidate bioactive agent is labeled, the presence of the label on the support, coupled with a 
lack of competitor binding, may indicate that the candidate agent is capable of binding to the 
HDAC8 protein. 



29 



WO 01/42437 



PCT/US00/33622 



In a preferred embodiment, the methods comprise differential screening to identity bioactive 
agents that are capable of modulating the activity of the HDAC8 proteins. Such assays can be 
done with the HDAC8 protein or cells comprising said HDAC8 protein. In one embodiment, the 
methods comprise combining an HDAC8 protein and a competitor in a first sample. A second 
5 sample comprises a candidate bioactive agent, an HDAC8 protein and a competitor. The 
binding of the competitor is determined for both samples, and a change, or difference in binding 
between the two samples indicates the presence of an agent capable of binding to the HDAC8 
protein and potentially modulating its activity. That is, if the binding of the competitor is different 
in the second sample relative to the first sample, the agent is capable of binding to the HDAC8 
10 protein. 

Alternatively, a preferred embodiment utilizes differential screening to identify drug candidates 
that bind to the native HDAC8 protein, but cannot bind to modified HDAC8 proteins. The 
structure of the HDAC8 protein may be modeled, and used in rational drug design to synthesize 
agents that interact with that site. Drug candidates that affect HDAC8 bioactivity are also 
1 5 identified by screening drugs for the ability to either enhance or reduce the activity of the 
protein. 

Positive controls and negative controls may be used in the assays. Preferably ail control and 
test samples are performed in at least triplicate to obtain statistically significant results. 
Incubation of all samples is for a time sufficient for the binding of the agent to the protein. 
20 Following incubation, all samples are washed free of non-specifically bound material and the 
amount of bound, generally labeled agent determined. For example, where a radiolabel is 
employed, the samples may be counted In a scintillation counter to determine the amount of 
bound compound. 

A variety of other reagents may be included in the screening assays. These include reagents 
25 like salts, neutral proteins, e.g. albumin, detergents, etc which may be used to facilitate optimal 
protein-protein binding and/or reduce non-specific or background interactions. Also reagents 
that otherwise improve the efficiency of the assay, such as protease inhibitors, nuclease 
inhibitors, anti-microbial agents, etc., may be used. The mixture of components may be added 
. in any order that provides for the requisite binding. 

30 Screening for agents that modulate an activity of an HDAC8 protein may also be done. In a 
preferred embodiment, methods of screening for a bioactive agent capable of modulating the 
activity of HDAC8 comprise the steps of adding a candidate bioactive agent to a sample of a 
HDAC8 protein (or cells comprising a HDAC8) and determining an alteration in the biological 
activity of the HDAC8 protein. "Modulating the activity of a HDAC8 protein" includes an 

35 increase in activity, a decrease in activity, or a change in the type or kind of activity present. 
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Thus, in this embodiment, the candidate agent should both bind to HDAC8 protein (although 
this may not be necessary), and alter its biological or biochemical activity as defined herein. 
The methods include both in vitro screening methods, as are generally outlined above, and in 
vivo screening of cells for alterations in the presence, distribution, activity or amount of HDAC8 
5 protein. 

Thus, in this embodiment, the methods comprise combining an HDAC8 sample and a candidate 
bioactive agent, and evaluating the effect on the HDAC8 activity. By "HDAC8 activity" or 
grammatical equivalents herein is meant one of the HDAC8 protein's biological activities, 
including, but not limited to, its ability to affect the chromatin structure, histone acetylation, 
1 0 transcription, the cell cycle, cell replication, cell viability etc., in vitro or in vivo. 

In a preferred embodiment, the activity of the HDAC8 protein is decreased; in another preferred 
embodiment, the activity of the HDAC8 protein is increased. Thus, bioactive agents that are 
antagonists are preferred in some embodiments, and bioactive agents that are agonists may be 
preferred in other embodiments. 

15 In a preferred embodiment, the invention provides methods of screening for bioactive agents 
capable of modulating the activity of an HDAC8 protein. The methods comprise adding a 
candidate bioactive agent, as defined above, to a cell comprising HDAC8 proteins. Preferred 
cell types include almost any cell. The cells contain a recombinant nucleic acid that encodes an 
HDAC8 protein. In a preferred embodiment, a library of candidate agents are tested on a 

20 plurality of cells. 

Detection of HDAC8 regulation may be done as will be appreciated by those in the art. In one 
embodiment, indicators of the HDAC8 activity are used, for example, deacetylation of 
acetylated synthetic peptides, such as, the peptide RTKR in which the lysine (K) is acetylated. 
There are a number of parameters that may be evaluated or assayed to allow the detection of 

25 alterations in HDAC8 regulation, including, but not limited to, cell viability assays, assays to 
determine whether cells are arrested at a particular cell cycle stage ("cell proliferation assays"), 
and assays to determine at which cell stage the cells have arrested ("cell phase assays"). 
Other parameters include mRNA synthesis, translation, deacetylation of acetylated peptides, as 
described above By assaying or measuring one or more of these parameters, it is possible to 

30 detect not only alterations in HDAC8 regulation, but alterations of different steps of the HDAC8 
regulation pathway. This may be done to evaluate native cells, for example to quantify the 
aggressiveness of a tumor cell type, or to evaluate the effect of candidate drug agents that are 
being tested for their effect on HDAC8 regulation. In this manner, rapid, accurate screening of 
candidate agents may be performed to identify agents that modulate HDAC8 regulation. 
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Thus, for example, the present compositions and methods are useful to elucidate bioactive 
agents that can cause a cell or a population of cells to alter their chromatin structure, histone 
acetylation, transcription activity, or either move out of one growth phase and into another, or 
arrest in a growth phase. In some embodiments, the cells are arrested in a particular growth 
5 phase, and it is desirable to either get them out of that phase or into a new phase. Alternatively, 
it may be desirable to force a cell to arrest in a phase, for example G1 t rather than continue to 
move through the cell cycle. Similarly, it may be desirable in some circumstances to accelerate 
a non-arrested but slowly moving population of cells into either the next phase or just through 
the cell cycle, or to delay the onset of the next phase. For example, it may be possible to alter 
1 0 the activities of certain enzymes, for example kinases, phosphatases, proteases or 
ubiquitination enzymes, that contribute to initiating cell phase changes. 

In a preferred embodiment, the methods outlined herein are done on cells that are not arrested 
in the G1 phase; that is, they are rapidly or uncontrollably growing and replicating, such as 
tumor cells. In this manner, candidate agents are evaluated to find agents that can alter histone 

1 5 acetylation, transcription, and the cell cycle regulation, i.e. cause the cells to arrest at cell cycle 
checkpoints, such as in G1 (although arresting in other phases such as S, G2 or M are also 
desirable). Alternatively, candidate agents are evaluated to find agents that can cause inhibition 
of HDAC8, and cause proliferation of a population of cells, i.e. that allow cells that are generally 
arrested in G1 to start proliferating again; for example, peripheral blood cells, terminally 

20 differentiated cells, stem cells in culture, etc. 

Accordingly, the invention provides methods of screening for alterations in HDAC8 regulation of 
a population of cells. By "alteration" or "modulation" (used herein interchangeably), is generally 
meant one of two things. In a preferred embodiment, the alteration results in a change, for 
example, in histone acetylation, the cell cycle of a cell, i.e. a proliferating cell arrests in any one 

25 of the phases, or an arrested cell moves out of its arrested phase and starts the cell cycle, as 
compared to another cell or in the same cell under different conditions. Alternatively, the 
progress of a cell through any particular phase may be altered; that is, there may be an 
acceleration or delay in the length of time it takes for the cells to move thorough a particular 
growth phase. For example, the cell may be normally undergo a G1 phase of several hours; the 

30 addition of an agent may prolong the G1 phase. 

The measurements can be determined wherein all of the conditions are the same for each 
measurement, or under various conditions, with or without bioactive agents, or at different 
stages of the cell cycle process. For example, a measurement of HDAC8 regulation can be 
determined in a cell or ceil population wherein a candidate bioactive agent is present and 
35 wherein the candidate bioactive agent is absent. In another example, the measurements of 
HDAC8 regulation are determined wherein the condition or environment of the cell or 
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populations of cells differ from one another. For example, the cells may be evaluated in the 
presence or absence or previous or subsequent exposure of physiological signals, for example 
hormones, antibodies, peptides, antigens, cytokines, growth factors, action potentials, 
pharmacological agents including chemotherapeutics, radiation, carcinogenics, or other cells 
5 (i.e. cell-cell contacts). In another example, the measurements of HDAC8 regulation are 
determined at different stages of the cell cycle process. In yet another example, the 
measurements of HDAC8 regulation are taken wherein the conditions are the same, and the 
alterations are between one cell or cell population and another cell or cell population. 

By a "population of cells" or "library of cells" herein is meant at least two cells, with at least 
1 0 about 1 0 3 being preferred, at least about 1 0 6 being particularly preferred, and at least about 1 0 8 
to 10 9 being especially preferred. The population or sample can contain a mixture of different 
cell types from either primary or secondary cultures although samples containing only a single 
cell type are preferred, for example, the sample can be from a cell line, particularly tumor cell 
lines, as outlined below. The cells may be in any cell phase, either synchronously or not, 
1 5 including M, G1 , S, and G2. In a preferred embodiment, cells that are replicating or proliferating 
are used; this may allow the use of retroviral vectors for the introduction of candidate bioactive 
agents. Alternatively, non-replicating cells may be used, and other vectors (such as adenovirus 
and lentivirus vectors) can be used. In addition, although not required, the cells are compatible 
with dyes and antibodies. 

20 Preferred cell types for use in the invention include, but are not limited to, mammalian cells, 
including animal (rodents, including mice, rats, hamsters and gerbils), primates, and human 
cells, particularly Including tumor cells of all types, including breast, skin, lung, cervix, 
colonrectal, leukemia, brain, etc. 

In a preferred embodiment, the methods comprise assaying one or more of several different cell 
25 parameters, including, but not limited to, cell viability, cell proliferation, and cell phase. 

In a preferred embodiment, cell viability is assayed, to ensure that a lack of cellular change is 
due to experimental conditions (i.e. the introduction of a candidate bioactive agent) not cell 
death. There are a variety of suitable cell viability assays which can be used, including, but not 
limited to, light scattering, viability dye staining, and exclusion dye staining. 

30 In a preferred embodiment, a light scattering assay is used as the viability assay, as is well 
known in the art. For example, when viewed in the FACS, cells have particular characteristics 
as measured by their forward and 90 degree (side) light scatter properties. These scatter 
properties represent the size, shape and granule content of the cells. These properties account 
for two parameters to be measured as a readout for the viability. Briefly, the DNA of dying or 
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dead cells generally condenses, which alters the 90° scatter; similarly, membrane blebbing can 
alter the forward scatter. Alterations in the intensity of light scattering, or the cell-refractive 
index indicate alterations in viability. 

Thus, in general, for light scattering assays, a live cell population of a particular cell type is 
5 evaluated to determine it's forward and side scattering properties. This sets a standard for 
scattering that can subsequently be used. 

In a preferred embodiment, the viability assay utilizes a viability dye. There are a number of 
known viability dyes that stain dead or dying cells, but do not stain growing cells. For example, 
annexin V is a member of a protein family which displays specific binding to phospholipid 

1 0 (phosphotidylserine) in a divalent ion dependent manner. This protein has been widely used for 
the measurement of apoptosis (programmed cell death) as cell surface exposure of 
phosphatidylserine is a hallmark early signal of this process. Suitable viability dyes include, but 
are not limited to, annexin, ethidium homodimer-1, DEAD Red, propidium iodide, SYTOX 
Green, etc., and others known in the art; see the Molecular Probes Handbook of Fluorescent 

1 5 Probes and Research Chemicals, Haugland, Sixth Edition, hereby incorporated by reference; 
see Apoptosis Assay on page 285 in particular, and Chapter 16. 

Protocols for viability dye staining for cell viability are known, see Molecular Probes catalog, 
supra, in this embodiment, the viability dye such as annexin is labeled, either directly or 
indirectly, and combined with a cell population. Annexin is commercially available, i.e., from 

20 PharMingen, San Diego, California, or Caltag Laboratories, Millbrae, California. Preferably, the 
viability dye is provided in a solution wherein the dye is in a concentration of about 100 ng/ml to 
about 500 ng/ml, more preferably, about 500 ng/ml to about 1 pg/ml, and most preferably, from 
about 1 (Jg/ml to about 5 Mg/ml. In a preferred embodiment, the viability dye is directly labeled; 
for example, annexin may be labeled with a fluorochrome such as fluorecein isothiocyanate 

25 (FITC), Alexa dyes, TRITC, AMCA, APC, tri-color, Cy-5, and others known in the art or 

commercially available, in an alternate preferred embodiment, the viability dye is labeled with a 
first label, such as a hapten such as biotin, and a secondary fluorescent label is used, such as 
fluorescent streptavidin. Other first and second labeling pairs can be used as will be 
appreciated by those in the art. 

30 Once added, the viability dye is allowed to incubate with the cells for a period of time, and 
washed, if necessary. The cells are then sorted as outlined below to remove the non-viable 
cells. 

In a preferred embodiment, exclusion dye staining is used as the viability assay. Exclusion 
dyes are those which are excluded from living cells, i.e. they are not taken up passively (they do 



34 



WO 01/42437 



PCT/US00/33622 



not permeate the cell membrane of a live cell). However, due to the permeability of dead or 
dying cells, they are taken up by dead cells. Generally, but not always, the exclusion dyes bind 
to DNA, for example via intercalation. Preferably, the exclusion dye does not fluoresce, or 
fluoresces poorly, in the absence of DNA; this eliminates the need for a wash step. 
5 Alternatively, exclusion dyes that require the use of a secondary label may also be used. 

Preferred exclusion dyes include, but are not limited to, ethidium bromide; ethidium homodimer- 
1; propidium iodine; SYTOX green nucleic acid stain; Calcein AM, BCECF AM; fluorescein 
diacetate; TOTO® and TO-PRO™ (from Molecular Probes; supra, see chapter 16) and others 
known in the art. 

1 0 Protocols for exclusion dye staining for cell viability are known, see the Molecular Probes 
catalog, supra. In general, the exclusion dye is added to the cells at a concentration of from 
about 100 ng/mi to about 500 ng/ml, more preferably, about 500 ng/ml to about 1 |Jg/mi T and 
most preferably, from about 0.1 pg/ml to about 5 pg/ml, with about 0.5 pg/ml being particularly 
preferred. The cells and the exclusion dye are incubated for some period of time, washed, if 

1 5 necessary, and then the cells sorted as outlined below, to remove non-viable cells from the 
population. 

In addition, there are other cell viability assays which may be run, including for example 
enzymatic assays, which can measure extracellular enzymatic activity of either live cells (i.e. 
secreted proteases, etc.), or dead cells (i.e. the presence of intracellular enzymes in the media; 
20 for example, intracellular proteases, mitochondrial enzymes, etc.). See the Molecular Probes 
Handbook of Fluorescent Probes and Research Chemicals, Haugland, Sixth Edition, hereby 
incorporated by reference; see chapter 16 in particular. 

in a preferred embodiment, at least one cell viability assay is run, with at least two different cell 
viability assays being preferred, when the fluors are compatible. When only 1 viability assay is 
25 run, a preferred embodiment utilizes light scattering assays (both forward and side scattering). 
When two viability assays are run, preferred embodiments utilize light scattering and dye 
exclusion, with light scattering and viability dye staining also possible, and all three being done 
in some cases as well. Viability assays thus allow the separation of viable cells from non-viable 
or dying cells. 

30 In addition to a cell viability assay, a preferred embodiment utilizes a ceil proliferation assay. By 
"proliferation assay" herein is meant an assay that allows the determination that a cell 
population is either proliferating, i.e. replicating, or not replicating. 

In a preferred embodiment, the proliferation assay is a dye inclusion assay. A dye inclusion 
assay relies on dilution effects to distinguish between cell phases. Briefly, a dye (generally a 
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fluorescent dye as outlined below) is introduced to cells and taken up by the cells. Once taken 
up, the dye is trapped in the cell, and does not diffuse out. As the cell population divides, the 
dye is proportionally diluted. That is, after the introduction of the inclusion dye, the cells are 
allowed to incubate for some period of time; cells that lose fluorescence over time are dividing, 
5 and the cells that remain fluorescent are arrested in a non-growth phase. 

Generally, the introduction of the inclusion dye may be done in one of two ways. Either the dye 
cannot passively enter the cells (e.g. it is charged), and the cells must be treated to take up the 
dye; for example through the use of a electric pulse. Alternatively, the dye can passively enter 
the cells, but once taken up, it is modified such that it cannot diffuse out of the cells. For 
1 0 example, enzymatic modification of the inclusion dye may render it charged, and thus unable to 
diffuse out of the cells. For example, the Molecular Probes CellTracker™ dyes are fluorescent 
chloromethyl derivatives that freely diffuse into cells, and then glutathione S-transferase- 
mediated reaction produces membrane impermeant dyes. 

Suitable inclusion dyes include, but are not limited to, the Molecular Probes line of 
1 5 CellTracker™ dyes , including, but not limited to CellTracker™ Blue, CellTracker™ Yellow- 
Green, CellTracker™ Green, CellTracker™ Orange, PKH26 (Sigma), and others known in the 
art; see the Molecular Probes Handbook, supra; chapter 15 in particular. 

In general, inclusion dyes are provided to the cells at a concentration ranging from about 

1 00 ng/ml to about 5 pg/ml, with from about 500 ng/ml to about 1 pg/ml being preferred. A wash 

20 step may or may not be used. In a preferred embodiment, a candidate bioactive agent is 

combined with the cells as described herein. The cells and the inclusion dye are incubated for 
some period of time, to allow cell division and thus dye dilution. The length of time will depend 
on the cell cycle time for the particular cells; in general, at least about 2 cell divisions are 
preferred, with at least about 3 being particularly preferred and at least about 4 being especially 

25 preferred. The cells are then sorted as outlined below, to create populations of cells that are 
replicating and those that are not. As will be appreciated by those in the art, in some cases, for 
example when screening for anti-proliferation agents, the bright (i.e. fluorescent) cells are 
collected; in other embodiments, for example for screening for proliferation agents, the low 
fluorescence cells are collected. Alterations are determined by measuring the fluorescence at 

30 either different time points or in different cell populations, and comparing the determinations to 
one another or to standards. 

In a preferred embodiment, the proliferation assay is an antimetabolite assay. In general, 
antimetabolite assays find the most use when agents that cause cellular arrest in G1 or G2 
resting phase is desired. In an antimetabolite proliferation assay, the use of a toxic 
35 antimetabolite that will kill dividing cells will result in survival of only those cells that are not 
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dividing. Suitable antimetabolites include, but are not limited to, standard chemotherapeutic 
agents such as methotrexate, cisplatin, taxol, hydroxyurea, nucleotide analogs such as AraC, 
etc. In addition, antimetabolite assays may include the use of genes that cause cell death upon 
expression. 

5 The concentration at which the antimetabolite is added will depend on the toxicity of the 
particular antimetabolite, and will be determined as is known in the art. The antimetabolite is 
added and the cells are generally incubated for some period of time; again, the exact period of 
time will depend on the characteristics and identity of the antimetabolite as well as the cell cycle 
time of the particular cell population. Generally, a time sufficient for at least one cell division to 
10 occur. 

In a preferred embodiment, at least one proliferation assay is run, with more than one being 
preferred. Thus, a proliferation assay results in a population of proliferating cells and a 
population of arrested cells. Moreover, other proliferation assays may be used, i.e., colorimetric 
assays known in the art. 

15 In a preferred embodiment, either after or simultaneously with one or more of the proliferation 
assays outlined above, at least one cell phase assay is done. A "cell phase" assay determines 
at which cell phase the cells are arrested, M, G1 , S, or G2. 

In a preferred embodiment, the cell phase assay is a DNA binding dye assay. Briefly, a DNA 
binding dye is introduced to the cells, and taken up passively. Once inside the cell, the DNA 

20 binding dye binds to DNA, generally by intercalation, although in some cases, the dyes can be 
either major or minor groove binding compounds. The amount of dye is thus directly correlated 
to the amount of DNA in the cell, which varies by cell phase; G2 and M phase cells have twice 
the DNA content of G1 phase cells, and S phase cells have an intermediate amount, depending 
on at what point in S phase the cells are. Suitable DNA binding dyes are permeant, and 

25 include, but are not limited to, Hoechst 33342 and 33258, acridine orange, 7-AAD, LDS 751 , 
DAPI, and SYTO 16, Molecular Probes Handbook, supra; chapters 8 and 16 in particular. 

In general, the DNA binding dyes are added in concentrations ranging from about 1 pg/ml to 
about 5 pg/ml. The dyes are added to the cells and allowed to incubate for some period of time; 
the length of time will depend in part on the dye chosen. In one embodiment, measurements 
30 are taken immediately after addition of the dye. The cells are then sorted as outlined below, to 
create populations of cells that contain different amounts of dye, and thus different amounts of 
DNA; in this way, cells that are replicating are separated from those that are not. As will be 
appreciated by those in the art, in some cases, for example when screening for anti-proliferation 
agents, cells with the least fluorescence (and thus a single copy of the genome) can be 
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separated from those that are replicating and thus contain more than a single genome of DNA. 
Alterations are determined by measuring the fluorescence at either different time points or in 
different cell populations, and comparing the determinations to one another or to standards. 

In a preferred embodiment, the cell phase assay is a cyclin destruction assay. In this 
5 embodiment, prior to screening (and generally prior to the introduction of a candidate bioactive 
agent, as outlined below), a fusion nucleic acid is introduced to the cells. The fusion nucleic 
acid comprises nucleic acid encoding a cyclin destruction box and a nucleic acid encoding a 
detectable molecule. "Cyclin destruction boxes" are known in the art and are sequences that 
cause destruction via the ubiquitination pathway of proteins containing the boxes during 

10 particular cell phases. That is, for example, G1 cyclins may be stable during G1 phase but 
degraded during S phase due to the presence of a G1 cyclin destruction box. Thus, by linking a 
cyclin destruction box to a detectable molecule, for example green fluorescent protein, the 
presence or absence of the detectable molecule can serve to identify the cell phase of the cell 
population. In a preferred embodiment, multiple boxes are used, preferably each with a 

1 5 different fluor, such that detection of the cell phase can occur. 

A number of cyclin destruction boxes are known in the art, for example, cyclin A has a 
destruction box comprising the sequence RTVLGVIGD (SEQ ID NO:7); the destruction box of 
cyclin B1 comprises the sequence RTALGDIGN (SEQ ID NO:8). See Glotzer et al., Nature 
349:132-138 (1991). Other destruction boxes are known as well: 
20 YMTVSIIDRFMQDSCVPKKMLQLVGVT (SEQ ID NO:9; rat cyclin B); 

KFRLLQETMYMTVSIIDRFMQNSCVPKK (SEQ ID NO:10; mouse cyclin B); 
RAILIDWLIQVQMKFRLLQETMYMTVS (SEQ ID NO:11; mouse cyclin B1); 
DRFLQAQLVCRKKLQWGITALLLASK (SEQ ID NO:12; mouse cyclin B2); and 
MSVLRGKLQLVGTAAMLL (SEQ ID NO:13; mouse cyclin A2). 

25 The nucleic acid encoding the cyclin destruction box is operabiy linked to nucleic acid encoding 
a detectable molecule. The fusion proteins are constructed by methods known in the art. For 
example, the nucleic acids encoding the destruction box is ligated to a nucleic acid encoding a 
detectable molecule. By "detectable molecule" herein is meant a molecule that allows a cell or 
compound comprising the detectable molecule to be distinguished from one that does not 

30 contain it, i.e., an epitope, sometimes called an antigen TAG, a specific enzyme, or a 
fluorescent molecule. Preferred fluorescent molecules include but are not limited to green 
fluorescent protein (GFP), blue fluorescent protein (BFP), yellow fluorescent protein (YFP), red 
fluorescent protein (RFP), and enzymes including luciferase and P-galactosidase. When 
antigen TAGs are used, preferred embodiments utilize cell surface antigens. The epitope is 

35 preferably any detectable peptide which is not generally found on the cytoplasmic membrane, 
although in some instances, if the epitope is one normally found on the cells, increases may be 
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detected, although this is generally not preferred. Similarly, enzymatic detectable molecules 
may also be used; for example, an enzyme that generates a novel or chromogenic product. 

Accordingly, the results of sorting after cell phase assays generally result in at least two 
populations of cells that are in different cell phases. 

5 The proteins and nucleic acids provided herein can also be used for screening purposes 
wherein the protein-protein interactions of the HDAC8 proteins can be identified. Genetic 
systems have been described to detect protein-protein interactions. The first work was done in 
yeast systems, namely the "yeast two-hybrid" system. The basic system requires a protein- 
protein interaction in order to turn on transcription of a reporter gene. Subsequent work was 

1 0 done in mammalian cells. See Fields et al. f Nature 340:245 (1989); Vasavada et al. ( PNAS 
USA 88:10686 (1991); Fearon et al. t PNAS USA 89:7958 (1992); Dang et al. a Mol. Cell. Biol. 
11:954 (1991); Chien et aL, PNAS USA 88:9578 (1991); and U.S. Patent Nos. 5,283,173, 
5,667,973, 5,468,614, 5,525,490, and 5,637,463. a preferred system is described in Serial Nos. 
09/050,863, filed March 30, 1998 and 09/359,081 filed July 22, 1999, entitled "Mammalian 

1 5 Protein Interaction Cloning System". For use in conjunction with these systems, a particularly 
useful shuttle vector is described in Serial No. 09/133,944, filed August 14, 1998, entitled 
"Shuttle Vectors". 

In general, two nucleic acids are transformed into a cell, where one is a "bait" such as the gene 
encoding an HDAC8 protein or a portion thereof, and the other encodes a test candidate. Only 

20 if the two expression products bind to one another will an indicator, such as a fluorescent 

protein, be expressed. Expression of the indicator indicates when a test candidate binds to the 
HDAC8 protein and can be identified as an HDAC8 protein. Using the same system and the 
identified HDAC8 proteins the reverse can be performed. Namely, the HDAC8 proteins 
provided herein can be used to identify new baits, or agents which interact with HDAC8 

25 proteins. Additionally, the two-hybrid system can be used wherein a test candidate is added in 
addition to the bait and the HDAC8 protein encoding nucleic acids to determine agents which 
interfere with the bait and the HDAC8 protein interactions. 

In one embodiment, a mammalian two-hybrid system is preferred. Mammalian systems provide 
post-translational modifications of proteins which may contribute significantly to their ability to 

30 interact. In addition, a mammalian two-hybrid system can be used in a wide variety of 

mammalian cell types to mimic the regulation, induction, processing, etc. of specific proteins 
within a particular cell type. For example, proteins involved in a disease state (i.e., cancer, 
apoptosis related disorders) could be tested in the relevant disease cells. Similarly, for testing 
of random proteins, assaying them under the relevant cellular conditions will give the highest 

35 positive results. Furthermore, the mammalian cells can be tested under a variety of 
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experimental conditions that may affect intracellular protein-protein interactions, such as in the 
presence of hormones, drugs, growth factors and cytokines, radiation, chemotherapeutics, 
cellular and chemical stimuli, etc., that may contribute to conditions which can effect protein- 
protein interactions, particularly those involved in cancer. 

5 Assays involving binding such as the two-hybrid system may take into account non-specific 
binding proteins (NSB). 

Expression in various cell types, and assays for HDAC8 activity are described above. The 
activity assays, such as having an effect on, for example, chromatin structure, histone 
acetylation, transcription etc can be performed to confirm the activity of HDAC8 proteins which 
1 0 have already been identified by their sequence identity/similarity or binding to HDAC8 as well as 
to further confirm the activity of lead compounds identified as modulators of HDAC8. 

The components provided herein for the assays provided herein may also be combined to form 
kits. The kits can be based on the use of the protein and/or the nucleic acid encoding the 
HDAC8 proteins. In one embodiment, other components are provided in the kit. Such 
1 5 components include one or more of packaging, instructions, antibodies, and labels. Additional 
assays such as those used in diagnostics are further described below. 

In this way, bioactive agents are identified. Compounds with pharmacological activity are able 
to enhance or interfere with the activity of the HDAC8 protein. The compounds having the 
desired pharmacological activity may be administered in a physiologically acceptable carrier to 
20 a host, as further described below. 

The present discovery relating to the role of HDAC8 proteins in the cell thus provides methods 
of inducing or preventing cell proliferation in cells. In a preferred embodiment, the HDAC8 
proteins, and particularly HDAC8 protein fragments, are useful in the study or treatment of 
conditions which are mediated by the HDAC8 proteins, i.e. to diagnose, treat or prevent HDAC8 
25 associated disorders. Thus, "HDAC8 associated disorders" or "disease state" include 

conditions involving both insufficient or excessive cell proliferation. Examples of insufficient and 
excessive cell proliferation include aging, apoptosis, necrosis, cancer etc with cancer being 
preferred. Cancer includes any mechanism relating.to cancer including but not limited to 
tumorigenesis, tumor metastasis, and angiogenesis. 

30 Thus, in one embodiment, HDAC8 regulation in cells or organisms are provided. In one 
embodiment, the methods comprise administering to a cell or individual in need thereof, a 
HDAC8 protein in a therapeutic amount. Alternatively, an anti-HDAC8 antibody that reduces or 
eliminates the biological activity of the endogeneous HDAC8 protein is administered. In another 
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embodiment, a bioactive agent as identified by the methods provided herein is administered. 
Alternatively, the methods comprise administering to a cell or individual a recombinant nucleic 
acid encoding an HDAC8 protein. As will be appreciated by those in the art, this may be 
accomplished in any number of ways. In a preferred embodiment, the activity of HDAC8 is 
5 increased by increasing the amount of HDAC8 in the cell, for example by overexpressing the 
endogeneous HDAC8 protein or by administering a gene encoding a HDAC8 protein, using 
known gene-therapy techniques, for example. In a preferred embodiment, the gene therapy 
techniques include the incorporation of the exogeneous gene using enhanced homologous 
recombination (EHR), for example as described in PCT/US93/03868, hereby incorporated by 
1 0 reference in its entirety. 

Without being bound by theory, it appears that HDAC8 protein is an important protein in 
regulating transcription and the HDAC8. Accordingly, disorders based on mutant or variant 
HDAC8 genes may be determined. In one embodiment, the invention provides methods of 
identifying cells containing variant HDAC8 genes comprising determining all or part of the 

1 5 sequence of at least one endogeneous HDAC8 genes in a cell. As will be appreciated by those 
in the art, this may be done using any number of sequencing techniques. In a preferred 
embodiment, the invention provides methods of identifying the HDAC8 genotype of an individual 
comprising determining all or part of the sequence of at least one HDAC8 gene of the individual. 
This is generally done in at least one tissue of the individual, and may include the evaluation of 

20 a number of tissues or different samples of the same tissue. The method may include 

comparing the sequence of the sequenced HDAC8 gene to a known HDAC8 gene, i.e. a wild- 
type gene. 

The sequence of all or part of the HDAC8 gene can then be compared to the sequence of a 
known HDAC8 gene to determine if any differences exist. This can be done using any number 

25 of known sequence identity programs, such as Bestfit, etc. In a preferred embodiment, the 
presence of a difference in the sequence between the HDAC8 gene of the patient and the 
known HDAC8 gene is indicative of a disease state or a propensity for a disease state. 
Accordingly, the present invention provides methods of determining the presence or absence of 
a genetic lesion in an HDAC8 nucleic acid characterized by, for example, an alteration affecting 

30 the integrity of a gene; a deletion, an insertion of substitution; a chromosomal rearrangement; 
an aberrant splicing pattern of a mRNA. 

In one embodiment, the invention provides methods of diagnosing a HDAC8 related condition in 
an individual. The methods comprise measuring the activity of HDAC8 in a sample, such as, a 
tissue or body fluid from the individual or patient. The activity of HDAC8 being preferably 
35 measured by assaying for the ability of a sample to deacetylate a known substrate of HDAC8. 
For example, a radioimmunoassay using specific antisera directed against acetylated vs. 



41 



WO 01/42437 



PCT/US00/33622 



nonacetylated peptides. Alternatively, the ability of an HDAC8 sample from the patient could be 
tested for its ability to deacetylate a radioacetyled natural or synthetic substrate in vitro. This 
activity is compared to the activity of HDAC8 from either a unaffected second individual or from 
an unaffected tissue from the first individual. When these activities are different, the first 
5 individual may be at risk for a HDAC8 associated disorder. For example, cancer cells "may" 
upregulate HDAC8 as a means of suppressing transcription of genes involved in terminal 
differentiation, which would force cells into a proliferative state. Inhibiting HDAC8 would 
therefore allow transcription of genes which would induce terminal differentiation of a cancer cell 
into non-disease state. In this way, for example, monitoring of various disease conditions may 
10 be done, by monitoring the levels of the protein or the expression of mRNA therefor. Similarly, 
expression levels may correlate to the prognosis. 

In one aspect, the expression levels of HDAC8 protein genes are determined in different patient 
samples or cells for which either diagnosis or prognosis information is desired. Gene 
expression monitoring is done on genes encoding HDAC8 proteins. In one aspect, the 

1 5 expression levels of HDAC8 protein genes are determined for different cellular states, such as 
normal cells and cells undergoing apoptosis or transformation. By comparing HDAC8 protein 
gene expression levels in ceils in different states, information including both up- and down- 
regulation of HDAC8 protein genes is obtained, which can be used in a number of ways. For 
example, the evaluation of a particular treatment regime may be evaluated: does a 

20 chemotherapeutic drug act to improve the long-term prognosis in a particular patient. Similarly, 
diagnosis may be done or confirmed by comparing patient samples. Furthermore, these gene 
expression levels allow screening of drug candidates with an eye to mimicking or altering a 
particular expression leve!. This may be done by making biochips comprising sets of important 
HDAC8 protein genes, such as those of the present invention, which can then be used in these 

25 screens. These methods can also be done on the protein basis; that is, protein expression 
levels of the HDAC8 proteins can be evaluated for diagnostic purposes or to screen candidate 
agents. In addition, the HDAC8 protein nucleic acid sequences can be administered for gene 
therapy purposes, including the administration of antisense nucleic acids, or the HDAC8 
proteins administered as therapeutic drugs. 

30 HDAC8 protein sequences bound to biochips include both nucleic acid and amino acid 
sequences as defined above. In a preferred embodiment, nucleic acid probes to HDAC8 
protein nucleic acids (both the nucleic acid sequences having the sequences outlined in the 
figures and/or the complements thereof) are made. The nucleic acid probes attached to the 
biochip are designed to be substantially complementary to the HDAC8 protein nucleic acids, i.e. 

35 the target sequence (either the target sequence of the sample or to other probe sequences, for 
example in sandwich assays), such that hybridization of the target sequence and the probes of 
the present invention occurs. As outlined below, this complementarity need not be perfect; 
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there may be any number of base pair mismatches which will interfere with hybridization 
between the target sequence and the single stranded nucleic acids of the present invention. 
However, if the number of mutations is so great that no hybridization can occur under even the 
least stringent of hybridization conditions, the sequence is not a complementary target 
5 sequence. Thus, by "substantially complementary" herein is meant that the probes are 
sufficiently complementary to the target sequences to hybridize under normal reaction 
conditions, particularly high stringency conditions, as outlined herein. 

A "nucleic acid probe" is generally single stranded but can be partially single and partially 
double stranded. The strandedness of the probe is dictated by the structure, composition, and 
1 0 properties of the target sequence. In general, the nucleic acid probes range from about 8 to 
about 100 bases long, with from about 10 to about 80 bases being preferred, and from about 30 
to about 50 bases being particularly preferred. In some embodiments, much longer nucleic 
acids can be used, up to hundreds of bases (e.g., whole genes). 

As will be appreciated by those in the art, nucleic acids can be attached or immobilized to a 
1 5 solid support in a wide variety of ways. By "immobilized" and grammatical equivalents herein is 
meant the association or binding between the nucleic acid probe and the solid support is 
sufficient to be stable under the conditions of binding, washing, analysis, and removal as 
outlined below. The binding can be covalent or non-covalent. By "non-covalent binding" and 
grammatical equivalents herein is meant one or more of either electrostatic, hydrophilic, and 
20 hydrophobic interactions. Included in non-covalent binding is the covalent attachment of a 
molecule, such as, streptavidin to the support and the non-covalent binding of the biotinylated 
probe to the streptavidin. By M covalent binding" and grammatical equivalents herein Is meant 
that the two moieties, the solid support and the probe, are attached by at least one bond, 
including sigma bonds, pi bonds and coordination bonds. Covalent bonds can be formed 
25 directly between the probe and the solid support or can be formed by a cross linker or by 

inclusion of a specific reactive group on either the solid support or the probe or both molecules, 
immobilization may also involve a combination of covalent and non-covalent interactions. 

In general, the probes are attached to the biochip in a wide variety of ways, as will be 
appreciated by those in the art. As described herein, the nucleic acids can either be 
30 synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid support" or other 
grammatical equivalents herein is meant any material that can be modified to contain discrete 
individual sites appropriate for the attachment or association of the nucleic acid probes and is 
35 amenable to at least one detection method. As will be appreciated by those in the art, the 
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number of possible substrates are very large, and include, but are not limited to, glass and 
modified or functionaiized glass, plastics (including acrylics, polystyrene and copolymers of 
styrene and other materials, polypropylene, polyethylene, polybutylene, polyurethanes, TeflonJ, 
etc.). polysaccharides, nylon or nitrocellulose, resins, silica or silica-based materials including 
5 silicon and modified silicon, carbon, metals, inorganic glasses, plastics, etc. In general, the 
substrates allow optical detection and do not appreciably show fluorescence. 

In a preferred embodiment, the surface of the biochip and the probe may be derivatized with 
chemical functional groups for subsequent attachment of the two. Thus, for example, the 
biochip is derivatized with a chemical functional group including, but not limited to, amino 

1 0 groups, carboxy groups, oxo groups and thiol groups, with amino groups being particularly 
preferred. Using these functional groups, the probes can be attached using functional groups 
on the probes. For example, nucleic acids containing amino groups can be attached to 
surfaces comprising amino groups, for example using linkers as are known in the art; for 
example, homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce Chemical 

15 Company catalog, technical section on cross-linkers, pages 155-200, incorporated herein by 
reference). In addition, in some cases, additional linkers, such as alkyl groups (including 
substituted and heteroalkyl groups) may be used. 

In this embodiment, oligonucleotides, corresponding to the nucleic acid probe, are synthesized 
as is known in the art, and then attached to the surface of the solid support. As will be 
20 appreciated by those skilled in the art, either the 5* or 3* terminus may be attached to the solid 
support, or attachment may be via an internal nucleoside. 

In an additional embodiment, the immobilization to the solid support may be very strong, yet 
non-covalent. For example, biotinylated oligonucleotides can be made, which bind to surfaces 
covalently coated with streptavidin, resulting in attachment. 

25 Alternatively, the oligonucleotides may be synthesized on the surface, as is known in the art. 
For example, photoactivation techniques utilizing photopolymerization compounds and 
techniques are used. In a preferred embodiment, the nucleic acids can be synthesized in situ, 
using well known photolithographic techniques, such as those described in WO 95/251 16; WO 
95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references cited within, all of which 

30 are expressly incorporated by reference; these methods of attachment form the basis of the 
Affimetrix GeneChip™ technology. 

"Differential expression," or grammatical equivalents as used herein, refers to both qualitative 
as well as quantitative differences in the genes' temporal and/or cellular expression patterns 
within and among the cells. Thus, a differentially expressed gene can qualitatively have its 
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expression altered, including an activation or inactivation, in, for example, normal versus 
apoptotic cell. That is, genes may be turned on or turned off in a particular state, relative to 
another state. As is apparent to the skilled artisan, any comparison of two or more states can 
be made. Such a qualitatively regulated gene will exhibit an expression pattern within a state or 
5 cell type which is detectable by standard techniques in one such state or cell type, but is not 
detectable in both. Alternatively, the determination is quantitative in that expression is 
increased or decreased; that is, the expression of the gene is either upregulated, resulting in an 
increased amount of transcript, or downregulated, resulting in a decreased amount of transcript. 
The degree to which expression differs need only be large enough to quantify via standard 
1 0 characterization techniques as outlined below, such as by use of Affymetrix GeneChip™ 
expression arrays, Lockhart, Nature Biotechnology 14:1675-1680 (1996), hereby expressly 
incorporated by reference. Other techniques include, but are not limited to, quantitative reverse 
transcriptase PGR, Northern analysis and RNase protection. 

As will be appreciated by those in the art, this may be done by evaluation at either the gene 
1 5 transcript, or the protein level; that is, the amount of gene expression may be monitored using 
nucleic acid probes to the DNA or RNA equivalent of the gene transcript, and the quantification 
of gene expression levels, or, alternatively, the final gene product itself (protein) can be 
monitored, for example through the use of antibodies to the HDAC8 protein and standard 
immunoassays (ELISAs, etc.) or other techniques, including mass spectroscopy assays, 2D gel 
20 electrophoresis assays, etc. 

In another method detection of the mRNA is performed in situ. In this method penmeabilized 
cells or tissue samples are contacted with a detectably labeled nucleic acid probe for sufficient 
time to allow the probe to hybridize with the target mRNA. Following washing to remove the 
non-specifically bound probe, the label is detected. For example a digoxygenin labeled 
25 riboprobe (RNA probe) that is complementary to the mRNA encoding an HDAC8 protein is 
detected by binding the digoxygenin with an anti-digoxygenin secondary antibody and 
developed with nitro blue tetrazolium and 5-bromo-4-chloro-3-indoyl phosphate. 

In another preferred method, expression of HDAC8 protein is performed using in situ imaging 
techniques employing antibodies to HDAC8 proteins. In this method cells are contacted with 

30 from one to many antibodies to the HDAC8 protein(s). Following washing to remove non- 
specific antibody binding, the presence of the antibody or antibodies is detected. In one 
embodiment the antibody is detected by incubating with a secondary antibody that contains a 
detectable label. In another method the primary antibody to the HDAC8 protein(s) contains a 
detectable label. In another preferred embodiment each one of multiple primary antibodies 

35 contains a distinct and detectable label. This method finds particular use in simultaneous 
screening for a plurality of HDAC8 proteins. The label may be detected in a fluorometer which 
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has the ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in this method. As will be appreciated by 
one of ordinary skill in the art, numerous other histological imaging techniques are useful in the 
invention and the antibodies can be used in ELISA, immunoblotting (Western blotting), 
5 immunoprecipitation, BIACORE technology, and the like. 

In one embodiment, the HDAC8 proteins of the present invention may be used to generate 
polyclonal and monoclonal antibodies to HDAC8 proteins, which are useful as described herein. 
Similarly, the HDAC8 proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify HDAC8 antibodies. In a 

1 0 preferred embodiment, the antibodies are generated to epitopes unique to the HDAC8 protein; 
that is, the antibodies show little or no cross-reactivity to other proteins. These antibodies find 
use in a number of applications. For example, the HDAC8 antibodies may be coupled to 
standard affinity chromatography columns and used to purify HDAC8 proteins as further 
described below. The antibodies may also be used as blocking polypeptides, as outlined 

1 5 above, since they will specifically bind to the HDAC8 protein. 

The anti-HDAC8 protein antibodies may comprise polyclonal antibodies. Methods of preparing 
polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies can be raised in a 
mammal, for example, by one or more injections of an immunizing agent and, if desired, an 
adjuvant. Typically, the immunizing agent and/or adjuvant will be injected in the mammal by 

20 multiple subcutaneous or intraperitoneal injections. The immunizing agent may include the 
HDAC8 protein or a fusion protein thereof. It may be useful to conjugate the immunizing agent 
to a protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 
bovine thyroglobulin, and soybean trypsin inhibitor. Examples of adjuvants which may be 

25 employed include Freund's complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid 
a, synthetic trehalose dicorynomycolate). The immunization protocol may be selected by one 
skilled in the art without undue experimentation. 

The anti-HDAC8 protein antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohier and 
30 Milstein, Nature . 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 

appropriate host animal, is typically immunized with an immunizing agent to elicit lymphocytes 
that produce or are capable of producing antibodies that will specifically bind to the immunizing 
agent. Alternatively, the lymphocytes may be immunized in vitro. 

The immunizing agent will typically include the HDAC8 protein or a fusion protein thereof. 
35 Generally, either peripheral blood lymphocytes ("PBLs") are used if ceils of human origin are 
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desired, or spleen cells or lymph node cells are used if non-human mammalian sources are 
desired. The lymphocytes are then fused with an immortalized cell line using a suitable fusing 
agent, such as polyethylene glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: 
Principles and Practice . Academic Press, (1986) pp. 59-103]. Immortalized cell lines are 
5 usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and human 
origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells may be 
cultured in a suitable culture medium that preferably contains one or more substances that 
inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), 
1 0 the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 

1 5 can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 
California and the American Type Culture Collection, Rockville, Maryland. Human myeloma 
and mouse-human heteromyeloma cell lines also have been described for the production of 
human monoclonal antibodies [Kozbor, J. Immunol. , 133:3001 (1984); Brodeur et al. f 
Monoclonal Antibody Production Techniques and Applications . Marcel Dekker, Inc., New York, 

20 (1987) pp. 51-63]. 

The culture medium in which the hybridoma cells are cultured can then be assayed for the 
presence of monoclonal antibodies directed against HDAC8 protein. Preferably, the binding 
specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
25 enzyme-linked immunosorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 
Scatchard analysis of Munson and Pollard, Anal. Biochem. . 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by limiting dilution 
procedures and grown by standard methods [Goding, supral . Suitable culture media for this 
30 purpose include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells may be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones may be isolated or purified from the 
culture medium or ascites fluid by conventional immunoglobulin purification procedures such as, 
for example, protein a-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
35 dialysis, or affinity chromatography. 
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The monoclonal antibodies may also be made by recombinant DNA methods, such as those 
described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
5 light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 
source of such DNA. Once isolated, the DNA may be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, 
or myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis 
of monoclonal antibodies in the recombinant host cells. The DNA also may be modified, for 

1 0 example, by substituting the coding sequence for human heavy and light chain constant 

domains in place of the homologous murine sequences [U.S. Patent No. 4,816,567; Morrison et 
al., supral or by covalently joining to the immunoglobulin coding sequence all or part of the 
coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention, or can 

1 5 be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing monovalent antibodies 
are well known in the art. For example, one method involves recombinant expression of 
immunoglobulin light chain and modified heavy chain. The heavy chain is truncated generally at 
20 any point in the Fc region so as to prevent heavy chain crosslinking. Alternatively, the relevant 
cysteine residues are substituted with another amino acid residue or are deleted so as to 
prevent crosslinking. 

In vitro methods are also suitable for preparing monovalent antibodies. Digestion of antibodies 
to produce fragments thereof, particularly, Fab fragments, can be accomplished using routine 
25 techniques known in the art. 

The anti-HDAC8 protein antibodies of the invention may further comprise humanized antibodies 
or human antibodies. Humanized forms of non-human (e.g., murine) antibodies are chimeric 
immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, Fab, Fab', F(ab') 2 or 
other antigen-binding subsequences of antibodies) which contain minimal sequence derived 
30 from non-human immunoglobulin. Humanized antibodies include human immunoglobulins 
(recipient antibody) in which residues from a complementary determining region (CDR) of the 
recipient are replaced by residues from a CDR of a non-human species (donor antibody) such 
as mouse, rat or rabbit having the desired specificity, affinity and capacity. In some instances, 
Fv framework residues of the human immunoglobulin are replaced by corresponding non- 
35 human residues. Humanized antibodies may also comprise residues which are found neither in 
the recipient antibody nor in the imported CDR or framework sequences. In general, the 
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humanized antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a non- 
human immunoglobulin and all or substantially all of the FR regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 
5 least a portion of an immunoglobulin constant region (Fc), typically that of a human 

immunoglobulin [Jones et al. t Nature . 321:522-525 (1986); Riechmann et al. f Nature . 332:323- 
329 (1988); .and Presta, Curr. Op. Struct. Biol. . 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. Generally, a 
humanized antibody has one or more amino acid residues introduced into it from a source 

1 0 which is non-human. These non-human amino acid residues are often referred to as "import" 
residues, which are typically taken from an "import" variable domain. Humanization can be 
essentially performed following the method of Winter and co-workers [Jones et al., Nature . 
321:522-525 (1986); Riechmann et al., Nature . 332:323-327 (1988); Verhoeyen et al., Science . 
239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the corresponding 

1 5 sequences of a human antibody. Accordingly, such "humanized" antibodies are chimeric 

antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact human variable 
domain has been substituted by the corresponding sequence from a non-human species. In 
practice, humanized antibodies are typically human antibodies in which some CDR residues 
and possibly some FR residues are substituted by residues from analogous sites in rodent 

20 antibodies. 

Human antibodies can also be produced using various techniques known in the art, including 
phage display libraries [Hoogenboom and Winter, J. Mol. Biol. . 227:381 (1991); Marks et al., J. 
MoL Biol. . 222:581 (1991)]. The techniques of Cole et al. and Boerner et al. are also available 
for the preparation of human monoclonal antibodies (Cole et al., Monoclonal Antibodies and 

25 Cancer Therapy . Alan R. Liss, p. 77 (1985) and Boerner et al., J. Immunol. . 147(1 ) :86-95 

(1991)]. Similarly, human antibodies can be made by introducing of human immunoglobulin loci 
into transgenic animals, e.g., mice in which the endogenous immunoglobulin genes have been 
partially or completely inactivated. Upon challenge, human antibody production is observed, 
which closely resembles that seen in humans in all respects, including gene rearrangement, 

30 assembly, and antibody repertoire. This approach is described, for example, in U.S. Patent 
Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following 
scientific publications: Marks et ai, Bio/Technology 10, 779-783 (1992); Lonberg et al, Nature 
368 856-859 (1994); Morrison, Nature 368 . 812-13 (1994); Fishwild et al., Nature 
Biotechnology 14. 845-51 (1996); Neuberoer. Nature Biotechnology 14. 826 (1996); Lonberg 

35 and Huszar, Intern. Rev. Immunol. 13 65-93 (1 995). 
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Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that have 
binding specificities for at least two different antigens. In the present case, one of the binding 
specificities is for the HDAC8 protein, the other one is for any other antigen, and preferably for a 
cell-surface protein or receptor or receptor subunit. 

5 Methods for making bispecific antibodies are known in the art. Traditionally, the recombinant 
production of bispecific antibodies is based on the co-expression of two immunoglobulin heavy- 
chain/light-chain pairs, where the two heavy chains have different specificities [Milstein and 
Cuello, Nature , 305:537-539 (1983)]. Because of the random assortment of immunoglobulin 
heavy and light chains, these hybridomas (quadromas) produce a potential mixture of ten 
1 0 different antibody molecules, of which only one has the correct bispecific structure. The 
purification of the correct molecule is usually accomplished by affinity chromatography steps. 
Similar procedures are disclosed in WO 93/08829, published 13 May 1993, and in Traunecker 
et ah, EMBO J. . 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody-antigen combining 
1 5 sites) can be fused to immunoglobulin constant domain sequences. The fusion preferably is 

with an immunoglobulin heavy-chain constant domain, comprising at least part of the hinge, 

CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region (CH1) 

containing the site necessary for light-chain binding present in at least one of the fusions. 

DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 
20 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 

host organism. For further details of generating bispecific antibodies see, for example, Suresh 

et al. f Methods in Enzvmoloav . 121:210 (1986). 

Heteroconjugate antibodies are also within the scope of the present invention. Heteroconjugate 
25 antibodies are composed of two covalently joined antibodies. Such antibodies have, for 
example, been proposed to target immune system cells to unwanted cells [U.S. Patent No. 
4,676,980], and for treatment of HIV infection [WO 91/00360; WO 92/200373; EP 03089]. It is 
contemplated that the antibodies may be prepared in vitro using known methods in synthetic 
protein chemistry, including those involving crosslinking agents. For example, immunotoxins 
30 may be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed, for example, in U.S. Patent No. 4,676,980. 

The anti-HDAC8 protein antibodies of the invention have various utilities. For example, anti- 
35 HDAC8 protein antibodies may be used in diagnostic assays for a HDAC8 protein, e.g., 
detecting its expression in specific cells, tissues, or serum. Various diagnostic assay 
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techniques known in the art may be used, such as competitive binding assays, direct or indirect 
sandwich assays and immunoprecipitation assays conducted in either heterogeneous or 
homogeneous phases [Zola, Monoclonal Antibodies: a Manual of Techniques . CRC Press, Inc. 
(1987) pp. 147-158]. The antibodies used in the diagnostic assays can be labeled with a 
5 detectable moiety. The detectable moiety should be capable of producing, either directly or 
indirectly, a detectable signal. For example, the detectable moiety may be a radioisotope, such 
as 3 H, 14 C, 32 P, 35 S, or 125 l, a fluorescent or chemiluminescent compound, such as fluorescein 
isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline phosphatase, beta- 
galactosidase or horseradish peroxidase. Any method known in the art for conjugating the 
1 0 antibody to the detectable moiety may be employed, including those methods described by 
Hunter et al. a Nature . 144:945 (1962); David et al. t Biochemistry . 13:1014 (1974); Pain et al., J, 
Immunol. Meth. . 40:219 (1981); and Nygren, J. Histochem. and Cvtochem. . 30:407 (1982). 

Anti-HDAC8 protein antibodies also are useful for the affinity purification of HDAC8 protein from 
recombinant cell culture or natural sources. In this process, the antibodies against HDAC8 

1 5 protein are immobilized on a suitable support, such a Sephadex resin or filter paper, using 
methods well known in the art. The immobilized antibody then is contacted with a sample 
containing the HDAC8 protein to be purified, and thereafter the support is washed with a 
suitable solvent that will remove substantially all the material in the sample except the HDAC8 
protein, which is bound to the immobilized antibody. Finally, the support is washed with another 

20 suitable solvent that will release the HDAC8 protein from the antibody. 

The anti-HDAC8 protein antibodies may also be used in treatment. In one embodiment, the 
genes encoding the antibodies are provided, such that the antibodies bind to and modulate the 
HDAC8 protein within the cell. 

In one embodiment, a therapeutically effective dose of an HDAC8 protein, agonist or antagonist 
25 is administered to a patient. By "therapeutically effective dose" herein is meant a dose that 
produces the effects for which it is administered. The exact dose will depend on the purpose of 
the treatment, and will be ascertainable by one skilled in the art using known techniques. As is 
known in the art, adjustments for HDAC8 protein degradation, systemic versus localized 
delivery, as well as the age, body weight, general health, sex, diet, time of administration, drug 
30 interaction and the severity of the condition may be necessary, and will be ascertainable with 
routine experimentation by those skilled in the art. 

A "patient" for the purposes of the present invention includes both humans and other animals, 
particularly mammals, and organisms. Thus the methods are applicable to both human therapy 
and veterinary applications. In the preferred embodiment the patient is a mammal, and in the 
35 most preferred embodiment the patient is human. 
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The administration of the HDAC8 protein, agonist or antagonist of the present invention can be 
done in a variety of ways, including, but not limited to, orally, subcutaneously, intravenously, 
intranasally, transdermal^, intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, 
or intraocularly. In some instances, for example, in the treatment of wounds and inflammation, 
5 the composition may be directly applied as a solution or spray. Depending upon the manner of 
introduction, the compounds may be formulated in a variety of ways. The concentration of 
therapeutically active compound in the formulation may vary from about 0.1-100 wt.%. 

The pharmaceutical compositions of the present invention comprise an HDAC8 protein, agonist 
or antagonist (including antibodies and bioactive agents as described herein) in a form suitable 

1 0 for administration to a patient. In the preferred embodiment, the pharmaceutical compositions 
are in a water soluble form, such as being present as pharmaceutical^ acceptable salts, which 
is meant to include both acid and base addition salts. "Pharmaceutically acceptable acid 
addition salt" refers to those salts that retain the biological effectiveness of the free bases and 
that are not biologically or otherwise undesirable, formed with inorganic acids such as 

1 5 hydrochloric acid, hydrobromic acid, sulfuric acid, nitric acid, phosphoric acid and the like, and 
organic acids such as acetic acid, propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic 
acid, mafonic acid, succinic acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic 
acid, mandelic acid, methanesulfonic acid, ethanesulfonic acid, p-toluenesuifonic acid, salicylic 
acid and the like. "Pharmaceutically acceptable base addition salts" include those derived from 

20 inorganic bases such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, 
zinc, copper, manganese, aluminum salts and the like. Particularly preferred are the 
ammonium, potassium, sodium, calcium, and magnesium salts. Salts derived from 
pharmaceutically acceptable organic non-toxic bases include salts of primary, secondary, and 
tertiary amines, substituted amines including naturally occurring substituted amines, cyclic 

25 amines and basic ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, 
triethylamine, tripropylamine, and ethanolamine. 

The pharmaceutical compositions may also include one or more of the following: carrier 
proteins such as serum albumin; buffers; fillers such as microcrystalline cellulose, lactose, corn 
and other starches; binding agents; sweeteners and other flavoring agents; coloring agents; and 
30 polyethylene glycol. Additives are well known in the art, and are used in a variety of 
formulations. 

Combinations of the compositions may be administered. Moreover, the compositions may be 
administered in combination with other therapeutics, including growth factors or 
chemotherapeutics and/or radiation. Targeting agents (i.e. ligands for receptors on cancer 
35 cells) may also be combined with the compositions provided herein. 
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In one embodiment provided herein, the antibodies are used for immunotherapy, thus, methods 
of immunotherapy are provided. By "immunotherapy" is meant treatment of HDAC8 protein 
related disorders with an antibody raised against a HDAC8 protein. As used herein, 
immunotherapy can be passive or active. Passive immunotherapy, as defined herein, is the 
5 passive transfer of antibody to a recipient (patient). Active immunization is the induction of 
antibody and/or T-cell responses in a recipient (patient). Induction of an immune response can 
be the consequence of providing the recipient with an HDAC8 protein antigen to which 
antibodies are raised. As appreciated by one of ordinary skill in the art, the HDAC8 protein 
antigen may be provided by injecting an HDAC8 protein against which antibodies are desired to 
1 0 be raised into a recipient, or contacting the recipient with an HDAC8 protein nucleic acid, 
capable of expressing the HDAC8 protein antigen, under conditions for expression of the 
HDAC8 protein antigen. 

In a preferred embodiment, a therapeutic compound is conjugated to an antibody, preferably an 
HDAC8 protein antibody. The therapeutic compound may be a cytotoxic agent. In this method, 

1 5 targeting the cytotoxic agent to apoptotic cells or tumor tissue or cells, results in a reduction in 
the number of afflicted cells, thereby reducing symptoms associated with apoptosis, cancer 
HDAC8 protein related disorders. Cytotoxic agents are numerous and varied and include, but 
are not limited to, cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins 
and their corresponding fragments include diptheria A chain, exotoxin A chain, ricin A chain, 

20 abrin A chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against HDAC8 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 
the antibody. 

In a preferred embodiment, HDAC8 protein genes are administered as DNA immunogens, 
25 either single nucleic acids or combinations of HDAC8 protein genes. Naked DNA immunogens 
are generally known in the art; see Brower, Nature Biotechnology 16:1304-1305 (1998). 
Methods for the use of nucleic acids as DNA immunogens are well known to one of ordinary 
skill in the art, and include placing an HDAC8 protein gene or portion of an HDAC8 protein 
nucleic acid under the control of a promoter for expression in an animal. The HDAC8 protein 
30 gene used for DNA immunogens can encode full-length HDAC8 proteins, but more preferably 
encodes portions of the HDAC8 proteins including peptides derived from the HDAC8 protein. In 
a preferred embodiment a patient is immunized with a DNA immunogen comprising a plurality of 
nucleotide sequences derived from a HDAC8 protein gene. Similarly, it is possible to immunize 
an animals with a plurality of HDAC8 protein genes or portions thereof, as defined herein. 
35 Without being bound by theory, following expression of the polypeptide encoded by the DNA 
immunogen, cytotoxic T-cells, helper T-cells and antibodies are induced which recognize and 
destroy or eliminate cells expressing HDAC8 proteins. 
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The following examples serve to more fully describe the manner of using.the above-described 
invention, as well as to set forth the best modes contemplated for carrying out various aspects 
of the invention. It is understood that these examples in no way serve to limit the true scope of 
this invention, but rather are presented for illustrative purposes. All patents, patent applications, 
5 publications, and references cited herein are expressly incorporated by reference in their 
entirety. Moreover, all sequences displayed, cited by reference or accession number in the 
references are incorporated by reference herein in their entirety. 
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EXAMPLES 
Example 1 
HDAC8 Nucleic Acid 

A search of the GenBank database for expressed sequence tags (ESTs) using Basic BLAST, 
5 which is available through the NCBI website (http://www.ncbi.nlm.nih.gov/blast/blast/cai> 
using the default parameters revealed an EST (40936; accession number AA33308) from a 
uterine library with homology to the amino terminal portion of HDAC3. To obtain a 
full-length-clone, the Marathon-Ready Human Utems cDNA library (Clontech, Palo Alto, CA) 
was screened using the technique of 3-RACE, in which the 3' end of a cDNA is amplified using 
1 0 PCR. The Marathon-Ready cDNA libraries are produced as a library of full length cDNAs 
flanked by adaptor primers. To perform 3* RACE, a PCR reaction is performed using a primer 
homologous to the adaptor region and a gene specific primer. The HDAC8 specific primer was: 
TGCGGAACGGTTTTAAGCGGAG. 



This technique revealed the presence of two transcripts of roughly equal abundance: one of 
15 1 700 bp and one of 800 bp. Sequence analysis of both forms demonstrated that the 800 bp 
transcript is identical to the 5' end of the 1700 bp transcript, and represents the amino terminal 
146 residues. 



HDAC8 protein is homologous to the RPD3 class (I) of HDACs. The HDAC1, HDAC2, and 
HDAC3 proteins contain a short carboxy-terminal extension relative to he predicted 377 amino 
20 acid HDAC8 protein (Figure 2). The percent amino acid identities of HDAC1 , 2, 3, and 8 were 
determined by the MacVector Program (Version 6.5M for Macintosh, Oxford Molecular Group). 
The results are: 



HDAC8 vs HDAC1 
HDAC8 vs HDAC2 
25 HDAC8vsHDAC3 



146/486 identities = 30%; 
151/491 identities = 31%; 
151/439 identities = 34% 



HDAC1 vs HDAC2 
HDAC2 vs HDAC3 
HDAC1 vs HDAC3 



410/488 identities = 84%; 
250/491 identities = 51%; 
252/484 identities = 52%. 



As shown above, HDAC1-3 share between 51% to 84% amino acid sequence identity. In 
30 contrast, the amino acid identity these proteins share with HDAC8 is from about 30% to 34%. 
Thus, all of the above HDAC proteins share sequence identity but HDAC8 is relatively more 
distantly related to the other HDAC proteins with form a more closely, interrelated group. 



55 



WO 01/42437 

We claim: 



CLAIMS 



PCT/US00/33622 



1. A recombinant nucleic acid encoding an HDAC protein comprising a nucleic acid that 
hybridizes under high stringency conditions to a sequence complementary to that set forth in 

5 Figure 1 (SEQ ID NO:1). 

2. The recombinant nucleic acid of claim 1 comprising a nucleic acid sequence as set forth in 
Figure 1 (SEQ ID NO:1). 

3. A recombinant nucleic acid encoding an HDAC protein comprising a nucleic acid having at 
least 85% sequence identity to a sequence as set forth in Figure 1 (SEQ ID NO:1). 

10 4. A recombinant nucleic acid encoding an amino acid sequence as shown in Figure 1 (SEQ ID 
NO:2). 

5. An expression vector comprising the recombinant nucleic acid according to any one of 
claims 1, 2, 3, or 4 operably linked to regulatory sequences recognized by a host cell 
transformed with the nucleic acid. 

15 6. A host cell comprising the recombinant nucleic acid according to any one of claims 1 , 2, 3, 4, 
or 5. 

7. A host cell comprising the vector of claim 6. 

8. A process for producing an HDAC protein comprising culturing the host cell of claim 6 or 7 
under conditions suitable for expression of said HDAC protein. 

20 9. The process according to claim 8 further comprising recovering said protein. 

10. A recombinant HDAC protein encoded by the nucleic acid of any of claims 1, 2, 3, or 4. 

11. A recombinant polypeptide comprising an amino acid sequence having at least 80% 
sequence identity with the sequence set forth in Figure 1 (SEQ ID NO:2). 

12. The recombinant polypeptide of claim 1 1 wherein said sequence is set forth in Figure 2 
25 (SEQIDNO:2). 

13. An isolated polypeptide which specifically binds to an HDAC protein according to claim 12. 
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14. The polypeptide according to claim 13 that is an antibody. 

15. The polypeptide according to claim 14 wherein said antibody is a monoclonal antibody. 

16. The monoclonal antibody of claim 15 wherein said antibody reduces or eliminates the 
biological function of said protein. 

5 17. A method of screening for a bioactive agent capable of binding to an HDAC protein, said 
method comprising: 

a) combining an HDAC8 protein and a candidate bioactive agent; and 

b) determining the binding of said candidate bioactive agent to said HDAC8 protein. 

1 8. A method of screening for a bioactive agent capable of modulating the activity of a HDAC 
1 0 protein, said method comprising: 

a) adding a candidate bioactive agent to a cell comprising a recombinant nucleic acid 
encoding HDAC8 protein; and 

b) determining the effect of said candidate bioactive agent on said cell. 

19. The method according to Claim 20, wherein a library of candidate bioactive agents is added 
15 to a plurality of cells comprising a recombinant nucleic acid encoding a HDAC8 protein. 

20. A method according to claim 21 further comprising adding a labeling agent that will label 
said cells in a specific phase of the cell cycle. 

21 . A method according to claim 20 wherein said cell is a tumor cell. 
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FIGURE 1: 



10 20 30 40 50 

AAGCGGAAG ATGG AGGAGCCGGAGGAACCGGCGGACAGTGGGCAGTCGCT 
TTCGCCTTCTACCTCCTCGGCCTCCTTGGCCGCCTGTCACCCGTCAGCGA 
MEEPEEPA'DSGQSL 

60 70 80 90 100 

GGTCCCGGTTTATATCTATAGTCCCGAGTATGTCAGTATGTGTGACTCCC 
CCAGGGCCAAATATAGATATCAGGGCTCATACAGTCATACACACTGAGGG 
VPVYIYSPEYVSMCDS 

110 120 130 140 150 

TGGCCAAGATCCCCAAACGGGCCAGTATGGTGCATTCTTTGATTGAAGCA 
ACCGGTTCTAGGGGTTTGCCCGGTCATACCACGTAAGAAACTAACTTCGT 
LAKIPKRASMVHSIilEA 

160 170 180 190 200 

TATGCACTGCATAAGCAGATGAGGATAGTTAAGCCTAAAGTGGCCTCCAT 
ATACGTGACGTATTCGTCTACTCCTATCAATTCGGATTTCACCGGAGGTA 
YALHKQMRIVKPKVASM 

210 220 230 240 250 

GGAGGAGATGGCCACCTTCCACACTGATGCTTATCTGCAGCATCTCCAGA 
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CCTCCTCTACCGGTGGAAGGTGTGACTACGAATAGACGTCGTAGAGGTCT 
EEMATFH TDA YLQHIiQ 

260 270 280 290 300 

AGGTCAGCCAAGAGGGCGATGATGATCATCCGGACTCCATAGAATATGGG 
TCCAGTCGGTTCTCCCGCTACTACTAGTAGGCCTGAGGTATCTTATACCC 
KVS QEGDDDHPDSIEYG 

310 320 330 340 350 

CTAGGTTATGACTGCCCAGCCACTGAAGGGATATTTGACTATGCAGCAGC 
GATCCAATACTGACGGGTCGGTGACTTCCCTATAAACTGATACGTCGTCG 
LGYDCPATEGIFDYAAA 

360 370 380 390 400 

TATAGGAGGGGCTACGATCACAGCTGCCCAATGCCTGATTGACGGAATGT 
ATATCCTCCCCGATGCTAGTGTCGACGGGTTACGGACTAACTGCCTTACA 
IGGATITAAQCLIDGM 

410 420 430 440 450 

GCAAAGTAGCAATTAACTGGTCTGGAGGGTGGCATCATGCAAAGAAAGAT 
CGTTTCATCGTTAATTGACCAGACCTCCCACCGTAGTACGTTTCTTTCTA 
CKVAINWSGGWHHAKKD 

460 470 480 490 500 

GAAGCATCTGGTTTTTGTTATCTCAATGATGCTGTCCTGGGAATATTACG 
CTTCGTAGACCAAAAACAATAGAGTTACTACGACAGGACCCTTATAATGC 
EASGFCYIiNDAVIiGI LR 

510 520 530 540 550 

ATTGCGACGGAAATTTGAGCGTATTCTC^ 

TAACGCTGCCTTTAAACTCGCATAAGAGATGCACCTAAACCTAGACGTGG 
LRRKFERILYVDLDIiH 

560 570 580 590 600 

ATGGAGATGGTGTAGAAGACGCATTCAGTTTCACCTCCAAAGTCATGACC 
TACCTCTACCACATCTTCTGCGTAAGTCAAAGTGGAGGTTTCAGTACTGG 
HGDGVEDAF SFTSKVMT 

610 620 630 640 650 

GTGTCCCTGCACAAATTCTCCCCAGGATTTTTCCCAGGAACAGGTGACGT 
CACAGGGACGTGTTTAAGAGGGGTCCTAAAAAGGGTCCTTGTCCACTGCA 
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VSLHKFSPGFFPGTGDV 

660 670 680 690 700 

GTCTGATGTTGGCCTAGGGAAGGGACGGTAGTACAGTGTAAATGTGCCCA 
CAGACTACAACCGGATCCCTTCCCTACCATGATGTCACATTTACACGGGT 
SDVGLGKGRYYSVNVP 

710 720 730 740 750 

TTCAGGATGGCATACAAGATGAAAAATATTACCAGATCTGTGAAAGCGTA 
AAGTCCTACCGTATGTTCTACTTTTTATAATGGTCTAGACACTTTCGCAT 
IQDGIQDEKYYQICESV 

760 770 780 790 800 

CTAAAGGAAGTATACCAAGCCTTTAATCCCAAAGCAGTGGTCTTACAGCT 
GATTTCCTTCATATGGTTCGGAAATTAGGGTTTCGTCACCAGAATGTCGA 
IfKEVYQAFNPKAVVLQL 

810 820 830 840 850 

GGGAGCTGACACAATAGCTGGGGATCCCATGTGCTCCTTTAACATGACTC 
CCCTCGACTGTGTTATCGACCCCTAGGGTACACGAGGAAATTGTACTGAG 
GADTIAGDPMCSFNMT 

860 870 880 890 900 

CAGTGGGAATTGGCAAGTGTCTTAAGTACATCCTTCAATGGCAGTTGGCA 
GTCACCCTTAACCGTTCACAGAATTCATGTAGGAAGTTACCGTCAACCGT 
PVGXGKC LKYXLQWQLA 

910 920 930 940 950 

ACACTCATTTTGGGAGGAGGAGGCTATAACCTTGCCAACACGGCTCGATG 
TGTGAGTAAAACCCTCCTCCTCCGATATTGGAACGGTTGTGCCGAGCTAC 
TLILGGGGYNIiANTARC 

960 970 980 990 1000 

CTGGACATACTTGACCGGGGTCATCCTAGGGAAAACACTATCCTCTGAGA 
GACCTGTATGAACTGGCCCCAGTAGGATCCCTTTTGTGATAGGAGACTCT 
WTYLTGVIDGKTLSSE 

1010 1020 1030 1040 1050 

TCCCAGATCATGAGTTTTTCACAGCATATGGTCCTGATTATGTGCTGGAA 
AGGGTCTAGTACTCAAAAAGTGTCGTATACCAGGACTAATACACGACCTT 
IPDHEFFTAYGPDYVLE 
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1060 1070 1080 1090 1100 

ATCACGCCAAGCTGCCGGCCAGACCGCAATGAGCCCCACCGAATCCAACA 
TAGTGCGGTTCGACGGCCGGTCTGGCGTTACTC^ 
ITPSCRPDRNEPHRIQQ 

1110 1120 1130 1140 1150 

AATCCTCAACTACATCAAAGGGAATCTGAAGCATGTGGTCTAGTTGACAG 
TTAGGAGTTGATGTAGTTTCCCTTAGACTTCGTACACCAGATCAACTGTC 
ILNYIKGNLKHVV* 

1160 1170 1180 1190 1200 

AAAGAGATCAGGTTTCCAGAGCTGAGGAGTGGTGCCTATAATGAAGACAG 
TTTCTCTAGTCCT^GGTCTCGACTCCTCACCACGGATATTACTTCTGTC 

1210 1220 1230 1240 1250 

CGTGTTTATGCAAGCAGTTTGTGGAATTTGTGACT 

GCACAAATACGTTCGTCAAACACCTTAAACACTGACGTCCCTTTTAAACT 

1260 1270 1280 1290 1300 

AAGAAATTACTTCCTGAAAATTTCCAAGGGGCATCAAGTGGCAGCTGGCT 
TTCTTTAATGAAGGACTTTTAAAGGTTCCCCGTAGTTCACCGTCGACCGA 

1310 1320 1330 1340 1350 

TCCTGGGGTGAAGAGGCAGGCACCCCAGAGTCCTCAACTGGACCTAGGGG 
AGGACCCCACTTCTCCGTCCGTGGGGTCTCAGGAGTTGACCTGGATCCCC 

1360 1370 1380 * 1390 1400 

AAGAAGGAGATATCCCACATTTAAAGTTCTTATTTAAAAAAAACACACAC 
TTCTTCCTCTATAGGGTGTAAATTTCAAGAATAAATTTTTTTTC 

1410 1420 1430 1440 1450 

ACACAAATGAAATTTTTAATCTTTGAAAATTA 

TGTGTTTACTTTAAAAATTAGAAAC TTTTAATAAAAATTCGCTTAACCCC 

1460 1470 1480 1490 1500 

AGGGGAGTATTTTAATCATCTTAAATGAAACAGATCAGAAGCTGGATGAG 
TCCCCTCATAAAATTAGTAGAATTTACTTTGTCTAGTCTTCGACCTACTC 

1510 1520 1530 1540 1550 

AGCAGTCACCAGTTTGTAGGGCAGGAGGCAGCTGACAGGCAGGGTTTGGG 
TCGTCAGTGGTCAAACATCCCGTCCTCCGTCGACTGTCCGTCCCAAACCC 
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1560 1570 1580 1590 1600 

CCTCAGGACCATCCAGGTGGAGCCCTGGGAGAGAGGGTACTGATCAGCAG 
GGAGTCCTGGTAGGTCCACCTCGGGACCCTCTCTCCCATGACTAGTCGTC 

1610 1620 1630 1640 1650 

ACTGGGAGGTGGGGAGAAGTCCGCTGGTGTTGTTTTAGTGTTATATATCT 
TGACCCTCCACCCCTCTTCAGGCGACCACAACAAAATCACAATATATAGA 

1660 1670 1680 

TTGGTTTTTTTAATAAAATCTTTGAAAACCTA 
AACCAAAAAAATTATTTTAGAAACTTTTGGAT 
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BEST AVAILABLE COP* 



